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AESTRACT 


This  thesis  presents  the  detailed  design  for  a  dynamic 
linker  suitable  for  microcomputer  operation.  The  design 
exhibits  the  usual  property  of  dynamic  linking  in  that  the 
binding  of  in terprccedure  symbolic  references  to  virtual 
addresses  is  deferred  until  the  symbolic  reference  is  first 
encountered  during  process  execution.  The  design  includes 
the  specification  of  dynamic  linker  modules  and  data 
structures.  Furthermore,  an  overview  of  necessary  operating 
system  support  is  presented  along  with  a  detailed  discussion 
of  all  additional  translator  output  required.  Hardware 
features  desirable  (but  not  necessary}  in  a  dynamic  licking 
environment  are  reviewed. 

Pynamic  linking  without  translator  support  and  unlinking 
of  an  object  (from  a  process  address  space'  are 
investigated.  A  subset  of  the  dynamic  linker  design  (not 
including  the  unlinking  capability)  was  implemented  on  an 
Intel  £060  microprocessor  as  a  demonstration  of  the 
feasibility  of  the  concepts 
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Dynamic  linking  has  been  previous*/  assumed  to  be 
restricted  to  those  computing  systems  that  were  specifically 
designed  to  support  a  dynamic  linker.  The  first  goal  of  this 
thesis  was  to  determine  if  specialized  hardware,  such  as 
found  in  Multics  [11],  is  essential  to  realize  dynamic 
linking.  And,  given  that  specialized  hardware  is  not 
necessary,  the  second  goal  was  to  design  a  linker  compatable 
with  existing  microcomputer  architectures. 

The  design  of  a  dynamic  linker  was  developed  witn  a 
basic  set  of  design  criteria  (Table  1>  established  as 
guidelines.  (A  complete  discussion  of  the  implicatiors  of 
these  criteria  is  delayed  until  the  end  of  this  thesis.'  The 
most  fundamental  criterion  which  characterizes  dynamic 
linking  relates  to  when  an  object  is  bound  to  a  virtual 
address  within  a  process  address  space.  In  the  traditional 
static  environment,  this  binding  occurs  prior  to  program 
execution.  In  a  dynamic  linking  environment,  binding  is 
delayed  until  an  object  is  first  referenced  by  a  process. 
This  capability  allows  tremendous  flexibility  in  the 
development  of  software  systems. 
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TA3IE  1  -  DESIGN  CRITERIA  FOP  A  IINSE? 


1.  Delayed  Einding  -  The  binding  (linking^  of  an 
external  object  to  a  virtual  address  f wi tfci n  a  process 
address  space)  must  rot  take  place  until  the  object  is 
first  referenced  during  program  execution. 

2.  limited  Overhead  -  Subsequent  references  tc  an  object 
(i.e.,  references  following  the  first  reference!  must 
not  impose  excessive  overhead  with  respect  to  process 
execution  speed  and  primary  storage* 

3.  Domain  Independence  -  The  dynamic  linker  must  be 
rorpa table  with  current  secure  operating  system  designs. 
In  a  multidomain  environment,  the  dynamic  linker  must  be 
capable  of  executing  in  the  domain  of  the  calling 
subroutine  (vice  executing  in  the  security  kernel1  ). 

4.  Syntatic  Compatatility  -  The  design  must  allow 
external  objects  to  be  utilized  in  the  same  context  as 
internally  defined  procedures  and  data.  (This  implies 
that  external  objects  can  be  used  as  parameters  subject 
only  to  the  limitations  of  tne  language  syntax.! 

5.  Pure  Object  Code  -  The  dynamic  linker  must  permit  the 
object  code  of  a  procedure  to  remain  pure,  allowing 
sharing  of  procedures  in  a  multiprogramming  environment. 

6.  Hardware  Independence  -  The  design  must  be 
implemen table  on  a  microprocessor  which  does  not  possess 
those  hardware  features  specifically  associated  witn 
dynamic  linking.  Ir.  Multics  [11],  the  features  includ°: 

a.  hardware  segmentation 

b.  demanding  paging 

c.  indirect  addressing  through  memory 

d.  a  linkage  fault  during  indirection 


■'‘It  has  been  shown  [7]  that  it  is  not  necessary  for  the 
dynamic  linker  to  reside  in  the  security  kernel  to  maintain 
system  security. 
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II.  SHCMPT-'Itt 


The  traditional  concept  of  Uniting  end  loading 
involves  one.  or  possibly  two  operating  system  routines  that 
103d  several  distinct  objects  into  memory,  combine  them  into 
one  address  space  (loading),  and  finally  resolve  a...r_ss  g 

between  objects  (Uniting).  Me  end  result  is  an  executable 

program . 

Tbe  static  and  inflexible  functions  carried  out  by  toe 
Uniting  loader  place  undesirable  limitations  on  tr„r*r 
development.  First,  a  program  must  be  intact  (i.e..  contain 
all  objects  required  for  proper  execution)  prior  to  run 
tine.  Second .  if  a  module  is  changed.  toe  whole  program  must 
be  reunited.  Jurtnermore.  a  module  may  be  statically  llnied 
to  several  programs  resulting  In  multiple  copies  of  a  module 
existing  within  the  system.  Dynamic  Unsing  is  proposed  as 

tn  r , t <  e  linkin'^  that  solves  these  protlens. 
an  alternative  to  static  iinn.ii. 

Dvnamic  Uniting  !#.  U.  Ml  o«crs  two  other  major 

advantages  over  static  lUti.x.  First.  */«»»«  »»“«/ 

allows  a  programmer  to  write  and  test  incomplete  programs 
since  one  may  include  in  a  subroutine  a  reference  to  ar  as 
J.,  unwritten  external  object  and,  as  long  as  tse  reference 
ls  cever  executed,  the  program  will  net  experience  a  run 
ure  error.  In  the  field  of  software  development,  this 
feature  Is  advantageous  since  incomplete  modules  nay  still 
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be  tested  individually.  (It  should  be  noted  at  tcis  time 
that  once  the  user  has  a  completely  tested  product,  it  may 
he  desired  to  statically  link  modules  together  to  avoid  tne 
run  time  overhead  associated  with  dynamic  linking.' 

Tne  second  major  advantage  of  dynamic  linking  is  tnat 
modules  of  a  program  need  not  be  venerated  by  the  same 
translator.  For  example,  in  a  dynamic  linking  environment 
one  may  use  FORTRAN  to  do  some  double  precision  scientific 
calculations.  If  the  results  were  then  stored  in  an  external 
data  structure,  they  could  be  displayed  using  a  dynamically 
linked  module  written  in  a  more  suitable  language  fcr  I/C 
formatting  such  as  PL/1.  Because  the  modules  'communicate' 
via  the  external  data  structure,  and  are  dynamically  linked 
to  each  other,  they  need  not  be  from  the  same  translator. 
(Note  that  a  dynamic  linker  does  not  prohibit  such  a 
'heterogenous'  program,  from  executing  tut  may  r.ot  te 
sufficient  in  itself  to  allow  proper  execution.) 

A.  THE  TRADITION AI  LINKING  LOADER 

First  of  all,  the  'linker'  and  the  'leader'  should  te 
considered  separate  operating  system  functions.  Linking  may 
still  be  viewed  as  the  combining  of  several  objects  into  one 
program?  however,  the  loading  process  actually  consist  of 
two  distinct  operations.  The  popular  concept  of  a  loader  is 
one  of  a  static  operation  prior  to  run  time  welch  ta.ces  sere 
object  code  associated  with  a  program  and  'leads'  this  code 
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into  main  memory  wnere  it  can  ce  executed.  Tnis  is  tne 
second  function  of  a  leader.  The  loader  must  first  determine 
where  eacn  object  will  he  placed  in  the  adcress  space  of  tne 
process  ^viz.,  a  program  in  execution).  (This  traditional 
concept  views  the  address  space  as  a  linear  array  of  memory 
locations.)  After  loading,  the  linking  loader  would  link 
distinct  objects  into  a  single  program  by  resolving  tne 
addressing  of  data  and  procedures  defined  external  to 
individual  subroutines.  (It  is  noted  tnat  some  reverse  tne 
order  by  linkinging  loaders  may  link  before  loading). 

3.  DYNAMIC  LINKING 

i 

The  alternative  to  the  static  linking  phase  of  the 
linking  loader  is  to  dynamically  link  separate  cvjects  at 
run  time.  This  involves  objects  referred  to  in  the  source 
code  of  a  program  by  a  symbolic  name  only.  The  corpl°te 
operation  (including  a  dyna'mic  linking  phase)  dictates  tnat 
the  object  be  located,  and  added  to  the  address  space  cf  a 
progran  (i.e.f  assigned  a  virtual  address'.  Then  tne 
reference  to  an  object's  symbolic  name  is  converted  into  an 
addressing  instruction  using  the  object's  virtual  adcress. 
This  implies  that  a  subroutine  as  it  exist  at  the  beginning 
of  run  time  cannot  properly  execute  since  the  object  code 
produced  from  a  reference  to  a  symbolic  name  must  be 
converted  into  a  virtual  address  in  the  address  space  of  a 
process.  This  address  conversion  is  known  as  dynamic 
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linking. 

In  order  to  support  dynamic  linking  a  system  must  nave 
the  ability  to  enter  objects  in  the  address  space  of  a 
process  during  run  time.  Additionally,  the  operating  system 
must  be  able  to  'load'  an  object  into  memory  during  program 
execution.  As  has  been  noted  these  two  functions 
traditionally  have  been  considered  operations  associated 
with  the  loader.  However,  it  should  be  apparert  that  this 
'loading'  is  actually  a  function  of  dynamic  memory 
allocation  using  techniques  such  as  pacing,  segmentation,  or 
dynamic  relocation.  Thus  in  a  dynamic  linking  environment 
the  loader  functions  ara  carried  out  by  the  operating  system 
memory  management  that  enters  objects  in  a  process  address 
space. 

C.  OPERATING  S YSTSy  ENVIRONMENT  FOR  A  DYNAMIC  LINKS?. 

1 .  The  Logical  levels  of  an  Operating  System 

It  is  useful  at  this  time  to  propose  an  abstract 
operating  system  as  an  environment  in  wnich  a  dynamic  linker 
will  exist.  This  operating  system  consist  of  four 
hierarchical  levels.  (An  operating  system  design  along  these 
lines  has  been  shown  feasible  for  microcomputers  [Kj.':  The 
most  fundamental  level  consist  of  the  hardware  associated 
with  the  target  machine.  Above  this  levels  is  a  software 
kernel  that  includes  the  most  basic  software  primitives 
including  memory  management,  file  primitives,  and 
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multiprocessing  support.  Conceptually,  the  kernel  includes 
those  software  routines  which,  in  a  secure  ope  rati  nr'  svstem, 
must  be  protected  from  malicious  or  inadvertant  tampering. 
In  a  multiprogramming  environment,  the  kernel  provides  tne 
capability  to  multiplex  resources  'i.e.,  line  printers,  disk 
units,  etc.)  for  various  us°r  processes. 

The  level  above  the  kernel,  the  supervisor  lev»l, 
consist  of  tnose  operating  system  routines  wnicr.  need  ret 
exist  in  the  kernel.  In  general,  the  supervisor  provides 
common  services  to  all  users.  The  final  level  is  the  user 
level  where  user  programs  and  data  reside.  (It  r.as  been 
shown  (by  Jansen  [7])  that  the  linker  should  be  able  to 
reside  in  all  user  levels.  Jansen  [7]  also  demonstrated  that 
the  dynamic  linker  need  not  and,  more  importantly,  should 
not  exist  in  tne  kernel.) 
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2 .  An  Introduction  to  the  Address  ?oace  y^££gr 

Fefore  an  object  ran  be  linked,  it  rust  te 

I 

addressable  by  a  process.  In  a  static  envircrrent ,  this 
would  equate  to  loading  tne  object  in  the  address  space  of  a 
process  by  allocating  to  it  a  linear  Meek  of  m^mor y. 
Essentially  this  is  what  is  done  in  a  dynamic  environment 

eicept  the  object  retains  its  identity  as  a  distinct  segment 

2 

and  is  allocated  a  virtual  address  .  (In  this  thesis, 
virtual  addresses  will  be  considered  to  consist  of  a  segment 
number  plus  some  offset  from  the  base  of  tnat  segment.)  ^he 
assignment  of  a  virtual  address  to  an  object  will  be  done  by 
tne  address  space  manager. 

The  address  space  manager  is  invoked  by  the  dynamic 
linker  with  a  request  to  make  an  object  known,  ""ne  address 
space  manager  does  this  by  assigning  to  the  object  a  unique 
identifier,  such  a?  a  segment  number,  that  can  be  used  to 
access  the  object  within  the  process  address  space.  An  entry 
for  the  object  will  then  be  made  by  the  address  space 
manager  in  a  table  to  prevent  assigning  multiple  identifiers 
to  the  same  object.  This  implies  that  a  search  would  first 
be  rade  of  this  table,  which  is  "ailed  the  Frocess  Reference 

2 

A  virtual  address  is  a  potentially  relocatable  address 
which  ray  be  converted  into  an  absolute  address  by  hardware. 
It  may  consist  of  a  segment  number  and  offset,  or  some  other 
relative  format  in  which  tne  bas°  address  of  tee  segment  is 
added  to  an  offset  to  achieve  the  absolute  address.  (However 
this  does  not  imply  that  segmentation  hardware  is  necessary 
in  a  dynamic  linking  environment.) 
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tauiis? 


Table  ^  ,  to  determine  if  tne  object  is  already  known.  If 
not,  the  address  space  manager  would  nave  tne  object 
assigned  a  segment  number  (identifier),  create  an  entry  for 
tne  object  in  the  process  reference  table,  and  return  tnis 
segment  number  to  the  linker. 

L .  terminology 

In  order  to  ensure  that  the  terminology  used  is 
understood,  the  following  definitions  are  offered. 

A  subroutine  will  be  defined  as  a  basic  unit  of 
standalone,  executable  code  (i.e.,  a  procedure).  Several 
subroutines  and  data  objects  can  be  combined  to  form  a 
program.  Stated  another  way,  a  progran  consist  of  all 
subroutines  and  data  modules  utilized  by  that  program  during 
its  execution.  A  process  [l]  is  a  program  in  execution  and 
is  characterized  by  an  execution  point  (usually  defined  by  a 
hardware  program  counter)  and  an  address  space.  Curing 
execution,  a  subroutine  may  call  an  external  object  that  is 
known  to  that  subroutine  only  by  its  symbolic  r.are  prior  to 
execution.  The  reference  to  an  external  object  within  a 
subroutine  will  be  called  an  external  reference  [15].  An 
external  object  [4j  nay  consist  of  either  data  (external 
data)  or  an  external  procedure  (that  is  itself  a 
subroutine).  Sacc  object  is  a  distinct  logical  entity  and 

3  in  Multics  [113 ,  the  process  reference  table  is  called 
the  known  segment  table. 
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will  at  times  also  be  referred  to  as  a  segment  [14].  (An 
effort  is  made  to  use  the  term  "object"  whenever  possible  to 
avoid  the  implication  that  a  processor  featuring  hardware 
segmentation  is  necessary  in  a  dynamic  linking  environment.) 


A 


\ 
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III.  THE  LINKING  PROCESS :  AN  OVEP.VIE'* 


Before  detailing  ttie  dynamic  licking  process,  a  brief 

walktnrougn  of  the  steps  involved  in  establishing  a  link 

between  the  subroutine  dCallerb  and  some  external  procedure 

<Target ! Entry_Name>  will  be  investigated.  fEntry_Name 

represents  one  of  multiple  accesses,  or  entry  points,  into 

<Tarmet>.  An  entry  point  into  an  object  can  be  corsidered  a 

label  that  can  be  referenced  by  an  external  object. 

Associated  with  each  entry  point  is  a  unique  entry  name,  and 

an  entry  point  offset  that  represents  the  relative  offset  of 

h 

tne  entry  point  from  the  starting  location  of  the  object.  ) 
Fundamentally,  the  following  events  must  occur  to  link 
<Target ! En try_Name>  to  <Caller>.  The  linker  must  be  invoked 
when  a  reference  to  <Target !  Entry_Name">  is  first 
encountered.  The  linker  must  be  capable  of  accessing  the 
symbolic  name  "Targe t !  Entry_Name"  ant*  usirg  that  symbolic 
name  to  learn  the  segment  number  of  <Target'.  Th»  linker 
will  then  establish  a  Itrk  to  <Taree t ! Sr try_Nare>  such  that 
subsequent  references  found  in  <Caller>  will  not  require 
invocation  of  the  linker  but  instead  will  result  in  either  a 
call  to  <Target ! Ent ry_Name> ,  in  the  case  of  an  external 

4 

The  term  entry  point  has  evolved  as  r°present ing 
either  the  label  'entry  point'  or  the  offset  associated  with 
that  label  [11,  14j .  This  convention  will  be  contirved  in 

this  tnesis  and,  where  the  possibility  of  ambiguity  exist,  a 
comment  will  be  made  to  ensure  clarity. 
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procedure,  or  a  memory  reference  to  the  virtual  address  of 
some  external  data. 

A.  THE  WALKTHROUGH 

When  tne  translator  encounters  an  external  reference  in 
the  source  code  of  ^Caller),  it  will  enter  the  symbolic  name 
"Target ! Entry _Wame"  in  the  symbolic  name  table  for  ^Caller^. 
(The  syrbolic  name  table  of  <Caller>  contains  the  symbolic 
names  of  external  references  and  data  associated  with  each 
entry  point  found  in  <Caller>.  Additionally,  the  symbolic 
name  table  exists  at  run  time.)  The  object  code  produced  for 
the  external  reference  to  <Target  I  Entry_Namel  (es  found  in 
<Caller>)  consist  cf  a  procedure  call  to  a  virtual  address 
in  <Caller>'s  linkage  table ^  .  (This  virtual  address  is 
constructed  at  run  time  using  a  base  register,  called  the 
linkage  pointer,  and  some  offset  into  Caller. link  generated 

4 

by  the  translator. )  The  virtual  address  called  is  an  entry 
in  Caller. link  set  aside  for  <Ta rget ! En try_Name>  an-*  will  be 
referred  to  as  an  outgoing  link.  The  outgoing  link  has  teen 
initialized  to  invoke  the  linker  and  pass  to  the  linker  the 
offset  (in  Caller. sym)  of  the  symbolic  name 
"Target i Entry_Name" .  The  linker  uses  this  offset  alon?  kith 

^  The  symbolic  name  table  of  an  object  will  be  called 
object. sym,  while  object. link  will  refer  to  an  object's 
linkage  table.  Thus  <'Caller>'s  symbolic  name  table  and 
linkage  table  become  Caller. sym  and  Caller. link 
respectively. 
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the  virtual  address  of  tne  base  of  Caller. sym  (vai ch  is 
stored  in  Caller. link) ,  to  access  the  symbolic  name  of  the 
external  reference.  Once  located,  the  linger  will  pass  the 
symbolic  name  "Target"  to  the  address  space  manager. 

"he  address  space  manager  first  determines  if  an  entry 
for  <Target>  already  exist  in  the  process  reference  table. 
If  r.ot,  the  address  space  manager  will  locate  the  object 
<Target>  and  have  it  assigned  a  segment  number  in  the 
address  space  of  the  executing  process.  It  will  also  make  ar. 
entry  for  <Target>  in  the  process  reference  table  and  return 
to  the  linker  the  segment  number  of  <’Target>.  (It  is  at  this 
point  that  <Target>  is  'known'  to  tne  executing  process.) 

The  linker  now  knows  the  segment  number  of  <Tarmet>  and 
must  create  a  linkage  table  for  ^Target'*  (if  ore  has  not 
already  been  constructed  by  an  external  reference  to 
<7arget>  within  another  subroutine).  A  template  accessablp 
to  the  linker  nas  been  constructed  by  the  translator  for 
this  purpose  and  is  appended  (after  minor  computations)  to 
the  end  of  the  combined  linkage  table  ^  (as  Targe t . 1 ink ) . 
(The  building  of  a  linkage  table  for  <Target>  allows  it  to 
engage  in  dynamic  linking.)  Additionally,  the  starting 
address  of  Target. link  is  entered  in  a /data  structure  known 

^  Th®  combined  linkage  table  contains  the  linkage  tables 
of  each  object  in  a  process.  (Note  that  it  is  not  necessary 
to  utilize  a  combined  linkage  table  in  an  implementation 
since  each  object's  linkage  table  could  be  allocated  its  own 
segment . ) 
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ds  tne  Linkage  Address  Table,  making  it  available  for  future 
linking  evaluations.  (The  lintcage  address  table  of  a  process 
can  be  considered  an  array  containing  the  base  address  of 
each  object's  linkage  table  and  is  subscripted  by  tne 
object's  segment  number.  > 

A  complete  virtual  address  for  ^Ta  rget !  Kntry_Nan,e>  car 
be  constructed  by  searching  Target. syrr  for  "?nt ry_Name"  to 
discover  the  entry  point  (offset)  and  incoming  link  offset 
associated  with  ”Entry_Name" .  (An  incoming  link  is  a  section 
of  an  object's  linkage  table  set  aside  to  allow  the 
performance  of  housekeeping  functions  prior  to  invoking  the 
object . ) 

The  linker  will  now  alter  the  outgoing  link  (in 
Caller. link)  to  jump  to  the  incoinin g  link  .'found  in 
Targe t . link) .  The  linker  then  constructs  the  incoming  link 
to  jump  to  the  virtual  address  of  <Target ! Entry_Name>  after 
setting  the  linkage  pointer  te  point  to  Target. link.  (Tne 
linkage  pointer  is  a  global  pointer,  e.g.  hardware  register, 
which  always  points  to  the  currently  executing  subroutine's 


linkage 

table.  Thus 

before 

execution 

in 

dTargeO  can 

commence , 

the  linkage 

pointer 

must  be 

set 

to  point  to 

Target. link.  mhe  reason  for  this  will  be  discussed  later.) 
After  the  outgoing  and  incoming  links  are  executed,  tne 
process  will  be  executing  in  <Target>. 

Vhen  <Target>  has  finished  it  will  execute  a  return 
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instruction.  Recall  that  the  only  procedure  call  in  the 
linkage  sequence  was  kCaller^’s  call  to  the  outpoint  link 
(in  Caller. link)  ensuring  a  return  to  <Caller>  after  the 
completion  of  <'Tar^et>.  The  final  step  is  to  reset  the 
linkage  pointer  to  the  virtual  (base)  address  of 
Caller. link.  (This  is  done  tv  the  translated  external 
reference  in  <Caller>.) 

The  steps  followed  for  linking  external  data  would  he 
similar  except  data  is  not  executed.  Therefore,  the  outgoing 
link  need  not  "invoke"  the  data  (via  the  incoming  link)  hut 
instead  must  allow  <Caller>  to  reference  the  data.  If 
indirect  addressing  is  available,  the  out^oinp  link  can  be  a 
storage  location  for  the  virtual  add  res s  of  the  external 
data  and  can  be  referenced  via  an  indirect  addressing 
instruction.  (Note  that  on  the  first  reference,  this 
indirect  addressing  instruction  must  be  able  to  invoke  tne 
linker  in  some  fashion.  In  Kultics,  this  is  done  by 
generating  a  fault  which  invokes  the  linker  as  the  fault 
handler.)  If  indirect  addressing  is  not  available  (or  cannot 
be  used  to  invoke  the  linker  on  first  reference  to  the 
data),  the  outgoing  link  can  contain  executable  instructions 
whicn  load  some  pointer  with  tne  virtual  address  of  the  data 
and  then  return  the  execution  point  to  dCallerb. 
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B.  A  SYNOPSIS  01  TFE  V AIKTHROOGh 

To  provide  the  reader  with  an  abreviated  review  cf  tne 
steps  to  snap  a  link- ,  tne  following  synopsis  is  ^rovi^ed. 
Additionally,  figure  1  is  annotated  with  the  number  of  each 
"step”  to  provide  added  clarity.  When  tne  executing 
procedure  (i.e.f  <Caller>)  encounters  a  translated  external 
reference  to  <Target ! Pn try_Name>  for  the  first  time,  the 
following  sequence  of  events  transpires: 


Step  1  -  The  execution  point  is  transferred  to  the 
outgoing  link  (in  Caller . 1 ink  ) . 

Step  2  -  The  linxer  is  invoked  by  the  initialized 
outgoing  link.  The  linker  is  passed  the  offset  cf 
<Target! 3ntry_Name>'s  entry  in  Caller.  sy~  as  an 
argument . 

Step  3  -  The  linker  references  Caller. svm  and  extracts 
the  symbolic  name  "Target i Entry_Name"  and  the  offset  (in 
Caller. link)  of  the  (appropriate)  outgoing  link  for 
<Target ! Entry_Name>. 

Step  4  -  The  linker  invokes  the  address  space  manager 
with  the  argument  "Target". 

Step  5  -  The  address  space  manager  enters  <Tarmet>  in 
the  process  address  space  (if  r.ecessarv)  and  returns  to 
the  linker  the  segment  number  of  <Target>. 

Step  6  -  The  linker  builds  a  linkage  table  for  <Tarret> 
(not  shown). 

Step  ?  -  The  linker  searches  Target. sym  for  "Entry_Niame ” 
and  extracts  the  offset  of  the  incoming  link  (for 
Entry_Name)  in  Target. link,  and  the  entry_poir.t 
associated  with  £ntry_Name. 

Step  3  -  The  linker  computes  the  virtual  address  in 
^Target)  associated  with  <Targe t ! ?n try_Nane>  and  the 
virtual  address  of  3ntry_Name's  incoming  link. 
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? tep  9  -  The  linker  establishes  the  lin*<  by  entering  a 
juinp  to  the  incoming  link  in  the  outgoing  link  (in 
Caller. link);  and  by  loading  the  incoming  link  (in 
Target. link)  with  an  instruction  which  loads  the  linkage 
pointer  with  the  address  of  Target. link  and  a  jump  to 
the  entry_point  in  <Target>. 

Step  1£  -  The  linker  invokes  <Target>  at  the 
entry_point. 


Figures  2  and  3  are  included  to  show  the  execution  ■ 
sequence  of  a  snapped  link  for  procedures  and  data 
respectively.  It  is  noted  that  a  link  that  has  already  been 
established  does  not  require  the  invocation  of  the  linker 
but  rather  directly  references  tne  external  object. 


SEQUENCE  OF  EVENTS  FOR  SUBSEQUENT  REFERENCES  TO  < TARGET I ENTRY_NAME> 


SEQUENCE  OF  EVENTS  FOR  SUBSEQUENT  REFERENCES  TO  <DATA^ENTRY_NAME> 

FIGURE  3 


IV.  THE  SPECIFICS  OF  EYNAMIC  LINKING 


A.  FUNCTIONS  OF  A  LINKER 

Dynamic  Uniting  centers  around  the  ability  to  alter 
impure  code  (linkage  tables7  )  during  run  time.  It  is  tnis 
feature  which  allows  invocation  of  the  linker  on  the  first 
reference  (to  an  object)  and  yet  permits  subsequent 
references  to  the  same  object  to  access  tnat  object  directly 
( i  -  e  . ,  without  invocation  of  the  linker).  Establishing,  or 
snapping  [111  ,  a  link  does  not  represent  all  the  functions 
desirable  in  a  linker.  Linkage  tables  must  be  constructed  on 
the  first  reference  (within  a  process)  to  an  object,  and 
system  limitations  may  subsequently  force  the  removal,  or 
unlinking,  of  an  object  from  a  process  address  space. 

1 .  Snapping  a  Link 

a.  Procedure  Links 

When  snapping  a  link  between  procedures,  the 
linker  will  initially  be  passed  the  offset  (in  Caller. sym) 
of  (the  entry  for)  the  symbolic  name  "Target ' Entry_Nare" . 
The  linker  can  find  •  Caller. syn  via  a  pointer  stored  in 
Caller. link.  (Pecall  tnat  the  1  inkage  point.e  r  always 
indicates  the  executing  procedure's  linkage  table  ensuring 
tre  linker  can  locate  Caller. link.  )  New  the  linker  knows  tne 

7 

It  should  be  noted  that  linkage  tables  avoid  the 
undesirable  effects  normally  associated  with  impure  cod<=>  by 
being  serially  reusable  and  a  per  process  entity  fi.e.,  one 
linkage  table  per  process  for  each  object). 
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symbolic  name  of  the  object  to  be  linked,  but  it  must 
determine  a  virtual  address  within  the  object  to  be 
referenced . 

In  order  to  make  <Target !Fntry_Name> 
addressable,  the  linker  must  determine  tne  segment  number 
associated  with  <Target>,  and  the  entry_poir.t  associated 
with  Entry_Name.  To  determine  the  segment  number  of 
<Target>,  the  linker  will  invoke  the  address  soace  manager 
passing?  the  symbolic  name  "Target"  as  an  argument.  The 
address  space  manager  will  enter  <Target>  in  the  process 
address  space  (if  it  is  not  already)  and  return  <Target>'s 
segment  number  to  the  linker  Obtaining  the  segment  number  is 
trivial  since  the  address  space  manager  will  return  this 
information  to  the  linker  when  passed  the  symbolic  name 

t*  „  i» 

Target  . 

Finding  the  entry_point  associated  with 
Entry_Name  requires  access  to  Target. sym.  As  will  be 
discussed,  a  second  function  of  the  linker  is  to  construct  a 
linkage  table  for  <Target>  (If  one  does  not  already  exist  as 
a  result  of  some  previous  reference  to  <’Target>!.  After 
Target. link  has  been  constructed,  to  find  Target. sym,  the 
segment  number  of  ^Target"*  is  first  used  to  access  (in  the 
linkage  address  table)  the  virtual  address  of  Target. link. 
(Recall  that  the  linkage  address  table  is  an  array  of 
pointers  to  the  linkage  table  of  eacn  object  in  a  process 
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address  space.)  A  pointer  is  found  in  Targpt.llnk  to 
Target .sym. 

It  is  proposed  that,  in  an  environment  allowing 
multiple  entry  points  into  an  object,  each  distinct  entry 

name  into  an  object  he  stored  in  the  object's  symbolic  name 
table.  In  addition,  the  entry  point  (viz.,  the  offset  into 
the  object)  and  the  offset  (in  object. link)  of  the  incoming 
link  associated  with  each  entry  point  will  also  be  stored  in 
object. sym.  Thus,  by  searching  Target. sym  with  the  argument 


Entry_Name  ,  the  linker  can  compute 

the 

entry_point 

and 

incoming  link  address 

necessary 

to 

snap  a 

link 

to 

<?arget ! Sn  try_Mame> . 

The  first  step 

in  the  actual 

snappi ng 

of 

the 

link  is  to  alter  the  outgoing  link  (in  Caller. link)  from  a 
jump  to  the  linker  to  a  jump  to  the  incoming-  link  (in 
Target. lirk).  The  address  jumped  to  is  formed  by  combining 
the  seg-ment  number  of  Target. link  (which  is  found  in  the 
linkage  address  table)  with  the  offset  (as  stored  in 
Target. sym)  of  the  incoming  link. 

The  second  step  is  the  building  of  the  incoming 
link.  The  incoming  link  consist  of  two  instructions.  The 
first  loads  the  linkage  pointer  (Lp)  with  the  virtual 
address  of  Target. link  ensuring  that  the  linkage  pointer 
always  points  to  the  currently  executing  procedure's  linkage 
table.  This  is  necessary  to  allow  a  procedure's  translated 
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code  (viz.,  object  code  segment)  to  reference  an  external 
object  while  remaining  pure.  A  reference  to  an  external 
object  is  achieved  via  the  outgoing  link;  the  virtual 
address  of  the  outgoing  link  is  computable  at  run  tire  by 
adding  a  fixed  (at  translation  time)  offset  to  tne  linkage 
pointer  and  allowing  the  linkage  pointer  to  vary  during 
execution  (see  figure  4).  Stated  another  way,  it.  is  tne 
linkage  pointer  which  allows  (pure)  translated  code  to  Jump 
to  an  entity  (the  outgoing  link)  which  is  not  bound  to  a 
virtual  address  until  run  time. 

The  second  instruction  in  the  incoming  link  is  a 
jump  to  the  virtual  address  of  (Target ! Intry_Name>  (of  the 
form  <segment_number !  entry_point>) .  Note  that  the  incoming 
link  may  already  exist  in  its  snapped  form  as  a  result  cf 
some  previous  reference  to  (Target ! Entry_Name' .  To  identify 
this  condition,  the  linker  will  first  check  a  'snapped  link 
bit'  which  is  set  if  the  incoming  link  is  snapped.  A  snapped 
link  is  shown  in  figure  5. 

One  may  observe  that  the  outgoing  end  incoming 
links  could  be  merged  into  one  link  consisting  of  a  load 
linkage  pointer  instruction  followed  by  a  jump  to 
(Target i En try_Name> .  This  change  eliminates  incoming  links 
but  effectively  requires  an  'incomi ng-type '  link  to  be 
constructed  in  each  outgoing  link  referencing  an  object. 
This  approach  was  not  choser.  since  it  requires  the 
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SOURCE  CODS 


PFOCETURE  eiakpie; 

DECLARE  <Target>  PROCEDURE  E^TE^NAI 5 
/*  code  */ 

BEGIN  /*  example  */ 

CALL  <"Target !  Er.  try_NameN>; 

I 

END;  /*  of  example  */ 


OBJECT  CODE 


/*  bepin  example  */ 


CALL  (Lp  +  offset  of  <Ta r&et  ! Ent ry_Name> 's  outgoirg  link! 


/*  end  example  */ 


TPANSIATED  EXTERNAL  REFERENCE 
FIGURE  4 
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construction  of  an  'incoming  link'  for  eacn  reference  to  a 


procedure  (vice 

just 

one 

incoming 

link)  and 

addi ticnally 

results 

in  multiple 

load 

linkage 

pointer 

instructions. 

( Note , 

ho  wever. 

tha  t 

i  n 

either  of 

these 

link  formats. 

subsequent  references  to  <Target !  Entry_Name>  do  cot  result 
in  invocation  of  the  linker.) 
b.  Data  links 

For  external  data,  the  steps  to  snap  a  link  are 
similar  except  the  linker  alters  the  outgoing  link  to  an 
instruction  which  loads  a  pointer  (preferably  a  register) 
with  the  virtual  address  of  tne  external  data,  and  a  return 

Q 

instruction  .  (As  will  be  shown,  it  is  necessary  for  data 
segments  to  have  both  linkage  tables  and  symbolic  name 
tables.  This  permits  the  linker  to  use  essentially  one 
algorithm  to  dynamically  link  botn  data  and  procedure.! 
Thus  any  subsequent  references  tc  the  external  data 
( <Da ta ! Ent ry_Name> )  initiated  by  <Caller>  would  result  in 
loading  a  pointer  with  the  virtual  address  of 
<Data  !  5n try_Name>  followed  by  a  return  to  'Callerb  and  wculi 
not  result  in  additional  linker  calls. 

As  with  procedures,  it  is  desirable  to  rpf°r°no® 
multiple,  symbolically  named  locations  (vi:.,  entry  points' 

R 

As  has  been  previously  discussed,  it  is  necessary  to  use 
this  form  of  outgoing  link  if  the  processor  hardware  cannot 
support  an  indirect  addressing  instruction  to  invoK*=  tne 
linker  (on  first  reference)  and  subsequently  access  the 
virtual  address  of  the  external  data. 
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in  a  data  structure.  This  implies  that  ^Latab  must  undergo  a 
translation  to  identify  entry  names  and  entry  points  and 
furthermore,  must  have  a  symbolic  name  table  in  which  this 
information  is  stored.  It  is  also  necessary,  given  tnis 
condition,  that  <Data>  nave  a  simplified  linkage  table 
consisting  of  a  linkage  table  header.  (The  contents  of  a 
linkage  table  header  will  be  presented  later./ 

c.  Construction  of  a  list  of  Snapped  Links 

For  each  object  in  a  process  address  space,  it 
may  be  desirable  for  the  linker  to  construct  a  linked  list 
which  contains  a  pointer  to  each  snapped  ou  t  go i ng  link 
referencing  that  object.  Tnis  linked  list  is  basically  used 
to  provide  a  record  of  references  to  an  object  tc  permit 
unlinking  an  object  from  an  address  space.  (Unlinking  will 
be  discussed  in  more  detail  later.)  A  pointer  to  the  start 
of  this  linked  list  would  be  stored  in  the  header  of  the 
object's  linkage  table  and  new  entries  to  the  linked  list 
would  be  entered  at  the  head  of  the  list  (when  snapping  an 
outgoing  link).  The  linked  list  could  easily  be  implemented 
by  storing  a  list  pointer  in  each  snapped  outgoing*  link. 

2.  Building  Linkage  Tables 

Before  <Tar»et>  can  commerce  execution,  it  must  have 
a  linkage  table  in  which  snapped  links  car  be  stored.  This 
allows  ^Ta  rge t>  to  engage  i  r.  iynam  i  c  linking  (if  it  is  a 
procedure).  There  exist  two  circumstances  under  wr.ich  tr.e 
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linkage  table  trust  be  built.  The  first,  and  obvious, 
situation  is  when  an  external  object  is  dynamically  linked 
to  a  process.  The  second  is  when  a  program  is  iritially 
started  executing  (viz.,  during  process  initialization). 

However  the  steps  involved  in  these  two  cases  do  not  differ, 
allowing  the  same  module  of  the  linker  to  be  utilized  in 
both  instances. 

To  build  a  linkage  table,  the  linker  will  access  a 
template  for  an  external  object  (or  program)  that  was 
constructed  during  translation.  The  template  is  an  exact 
duplicate  cf  object. link  with  tne  exception  of  the  symbolic 
name  table  virtual  address.  The  lirker  must  therefore  only 
add  the  segment  number  of  'Target>  to  the  symbolic  name 
table  offset  as  found  in  the  template  to  obtain  a  complete 
virtual  address  (for  Target.sym).  (This  approach  assumes 
Target. sym  is  a  part  of  the  translated  code  of  fTargetb.) 

The  remainder  of  the  template  is  then  appended  to  tne 

9 

combined  linkage  table  .  An  example  cf  an  Initialized 
linkage  table  (and  thus  a  template)  is  given  in  figure  C. 

"here  are  two  problems  relat®i  to  the  implem^r.ta  t  icr. 
of  this  linker  function  wnich  require  discussion.  The  first 

9 

It  is  not  necessary  for  an  implementation  to  include  the 
combined  linkage  table  sir.re  individual  linkage  tables 
be  assigned  unique  segment  numbers.  In  fact,  in  a 
multidomain  environment  [S] ,  it  is  desired  to  assign  linkage 
tables  to  separate  segments  since  tnis  permits  the  dynamic 
linker  to  be  domain  independent  (in  accordance  with  the 
design  criteria  of  Table  1). 
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FIGURE  5 
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question  involves  where  the  template  is  located  in  c  process 
address  space.  One  does  net,  ;.n  general,  want  the  template 
to  be  a  part  of  tne  object  code  since  this  will  result  in  an 
entity  (the  template'  which  is  used  only  once  becoming  an 
extraneous  part  of  a  process,  (.’vote  that  system  limitations 
may  force  this  shortcoming  on  an  implementation.’'  A  solution 
in  a  non -segmented  system  is  to  ma’-re  the  template  a  separate 
file.  (One  may  not  wish  to  do  this  in  a  segmented  system  if 
the  number  of  segments  represents  a  limited  asset  and  a  file 
corresponds  to  a  segment  since  this  would  require  assigning 
the  template  its  own  segment  number.)  However,  in  a  derar.d 
paging  environment,  the  template  can  be  a  part  of  tne  object 
code  since  it  will  only  reside  in  memory  when  required  and 
will  then  be  'paged'  out.  Because  it  will  never  again  be 
referenced,  the  template  will  never  again  be  loaded  into 
memory. 


/ 
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This  leads  to  the  second  pro  tier*  of  ensuring  the 
linker  can  find  ta?  template  when  it  is  a  part  of  tne  object 
code.  There  are  several  solutions  to  this,  the  most  «imple 
of  which  is  to  place  a  pointer  to  the  template  at  seme  known 

location  in  the  object  code.  4nother  solution  would  entail 
making  the  template  a  separate  file.  Thus  when  building  a 
linkage  table,  the  template  is  brought  into  a  process 
address  space,  copied  into  the  combined  linkage  table,  and 
then  deleted  from  tne  process  address  space. 

?.  Unsnapping  Objects 

It  may  be  necessary  to  remove  an  object  frem  tne 
address  space  of  a  program.  This  situation  ray  occur,  for 
example,  when  using  the  7.SU22  processor  [12]  witn  cr.e  memory 
management  unit  (MVU).  Since  this  hardware  configuration 
allows  a  maximum  of  64  segments  (some  of  which  will  be 
allocated  to  the  operating  system),  it  is  entirely  possible 
that  a  process  may  require  in  excess  of  the  maximum  number 
of  available  segments.  It  is  desirable  then  to  be  able  to 
remove  an  object  from  the  process  address  Space  and  unsnap 
all  outgoing  links  referring  to  that  object. 

The  unsnapping  of  an  outgoing  link  is  a  simple 
procedure.  The  snapped,  outgoing  link  is  merely  replaced  by 
an  entry  equivalent  to  tne  original,  unsnapped  outgoing 
link.  More  specifically,  tnis  unsnapped  link  consist  of  code 
to  pass  the  linker  the  offset  of  the  ext°rral  object's 


41 


symbolic  nape  in  object. sym  followed  by  the  invocation  of 
the  linker.  (For  simplicity,  it  will  be  assumed  that  the 
linker  is  invoked  via  a  jump  instruction.)  This  implies  that 
a  portion  of  each  snapped  link  must  be  set  aside  to  store 
the  offset  of  the  symbolic  name  for  use  during 
unlinking10 . 

The  first  step  in  the  unlinking  process  occurs  when 
the  address  space  manager,  after  being  requested  by  the 
linker  to  add  an  object  to  a  process  address  space,  returns 
a  message  to  tne  linker  indicating  no  segment  numbers  are 
available  (if  this  is  the  case).  The  linker  would  then  cause 
a  segment  to  be  deallocated.  ' 

If  desired,  tne  object's  linkage  table 
( object . link) ,  can  be  deleted  from  tne  combined  linkage 
table  by  performing  a  compaction  on  the  combined  linkage 
table.  (Note  that  compaction  is  not  necessary  since,  aside 
from  resulting  in  unused  memory  in  the  combined  linkage 
table,  if  the  deleted  segment  is  centered  in  the  process 
address  space,  a  new  linkage  table  will  be  built  and 
appended  to  the  combined  linkage  table.)  If  a  compaction  is 


Note  that  all  information  necessary  tc  reset  the  link 
(thus  deleting  the  requirement  to  store  tne  offset  ir.  tr.e 
linkage  table)  is  available  in  tne  combined  linkage  table, 
the  subroutine  offset  table  and  the  template.  However,  the 
steps  necessary  to  extract  this  data  are  rather  involved  and 
the  alternative  of  savin?  the  offset  within  a  snapped  link 
is  su??ested  unless  infrequent  unlinkiu?  evolutions  are 
expected . 
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done,  tiie  deleted  linkage  table  contains  tr.reads  ir.  tne 
linked  list  of  other  segments,  whicn  must  te  removed  without 
destroying  the  linked  list  they  were  a  part  of.  Ore  solution 
to  this  problem  is  to  implement  a  doubly  linked  or  circular 

linked  list  (by  having  the  last  entry  of  the  list  point  tc 
tne  linkage  address  tatle  instead  of  being  set  to  nil).  Now, 
prior  to  removing  object. link,  tne  linker  could  find  and 
adjust  each  thread  (of  a  linked  list)  with  a  node  in 
object. link  ensuring  the  integrity  of  other  segments'  linked 
list . 

Compaction  presents  two  other  pro'blerrs.  First,  when 
object. link  is  removed,  other  subroutines'  linkage  tables 
may  be  relocated  within  the  combined  linkage  table  thus 
receiving  new  virtual  addresses.  This  requires  that  the 
linkage  address  table  values  for  those  linkage  tables  along 
with  linked  list  threads  pointing  into  them  to  be  adjusted 
accordingly.  The  correction  must  be  dene  prior  to  actually 
compacting  (because  linked  list  threads  in  the  deleted 
linkage  table  will  be  lest  during  compaction'  and  reiuires 
tnat  addresses  in  the  combined  linkage  table  'i.e., 
subroutine  offset  table,  linked  list,  and  snapped  link 
addresses)  be  corrected  by  the  size  of  the  reroved  linkage 
table.  A  second  problem  relates  to  snapped  outgoing  links 
which  jump  to  incomirg  links  in  relocated  linkage.  T:ocse 
must  also  te  Adjusted  by  the  size  of  object. link.  Note  that 
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a  subroutine's  linked  list  identifies  each  outgoing  link 
that  jumps  into  its  linkage  table.  Therefore,  every 
procedure  segment  whose  linkage  address  table  value  requires 
correcting  trust  have  each  entry  in  its  linked  list  updated. 

When  unsnapping,  tne  linked  list  (constructed  the 
linker)  is  traversed  and  each  entry  in  the  list  is 
reinitialized.  Note  that  unlinking  affects  rrar.y  subroutine 
linkage  tables  yet  the  linkage  pointer  still  points  to 
object. link  for  the  subroutine  which  originally  invoked  the 
linker.  This  implies  that  linked  list  pointers  must  either 
be  complete  virtual  addresses  or  relative  to  the  start  of 
the  combined  linkage  table  (i.e.,  they  cannot  be  relative  to 
the  linkage  pointer. ) 

An  alternative  to  a  linked  list  implementation  is  to 
have  the  linker  search  the  combined  linkage  table  for  all 
snapped  outgoing  links  referencing  the  deleted  segment  and 
-reset  each  one  found.  (This  is  a  less  general  solution  since 
it  requires  the  linker  to  know  the  format  of  all  possible 
linkage  table  entries  in  order  to  identify  those  wr.ich  must 
be  reset.)  Once  all  linkage  table  entries  have  been 
reinitialized,  the  object's  linkage  address  table  entry  is 
set  to  nil,  and  the  object  and  its  linkage  table  (if 
desired)  are  removed  from  tne  process  address  space. 
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3.  OPERATING  SYSTEM  SUPPORT 

1 .  Tne  Address  Space  Manager 

As  has  been  noted,  before  a  link  to  an  object  ran  be 
snapped,  the  object  must  first  be  entered  in  the  address 

space  of  a  process.  A  request  to  enter  an  object  (i.e.,  make 
it  known)  is  forwarded  from  tne  linker  to  the  address  space 
manager.  The  address  space  manager  will  be  passed  the 
symbolic  name  of  the  object  tnat  is  to  be  made  accessable 
and  will  first  search  each  entry  in  the  process  reference 
table  to  determine  if  an  entry  already  pxist  fcr  the  object. 
If  so,  it  will  return  to  tne  linker  tne  segment  ranter  of 
the  object. 

If  the  object  is  not  accessable,  the  address  space 
manager  must  first  call  on  File  Management  to  locate  the 
object.  After  the  object  is  located,  Memory  Management  is 
invoked  to  assign  a  segment  number  to  tr.e  object  11  .  If 
Memory  management  were  to  indicate  that  it  had  no  segment 
numbers  left  to  assign,  tn°  address  space  manager  wculi 
return  to  the  linker  a  message  to  this  effect. 

11  It  is  realized  that  this  represents  a  very  vaeue 
description  of  how  an  object  is  located  and  assigned  a 
segment  number.  However,  since  the  exact  steps  involved  are 
highly  dependent  on  the  operating  environment  and  are 
fundamental  to  most  mul ti programming  systems,  it  is  felt 
that  adequate  information  exist  elsewhere  to  allow 
implementation  of  these  functions  without  discuesinm  them  in 
this  thesis.  Note  that  the  file  system  in  use  may  be 
extremely  sophisticated  as  in  Multics  fn],  or  represent  a 
simple  one-to-one  maoping  of  symbolic  names  to  corresponding 
files. 


45 


2. 


The  Process  Reference  Table 


The  process  reference  table  contains  an  entry  for 
each  object  in  the  address  space  of  a  process.  The  fcrrrat 
for  an  entry  (figure  7)  includes  the  symbolic  name  of  the 
object  alone  with  the  segment  number  of  the  object.  A  third 
item  which  may  be  found  in  the  process  reference  table  is  a 
removal  status  reflecting  the  priority  of  an  object  for 
removal  when  unlinking. 

Note  that  unlike  a  symbolic  name  table  entry,  the 
symbolic  name  found  in  the  process  reference  table  does  not 
include  entry  names.  For  example,  a  process  may  contain 
external  references  to  <Tarfeet ' Entry_N'ame_l>  and 
<Tar?et | Fn try_Name_2> ,  but  the  process  reference  table  would 
only  contain  one  entry  for  <Target>. 


i  symbolic  :  segment  :  removal  ! 
i  name  :  number  :  priority  ' 


Figure  7  -  Process  Reference  Table  Entry 


3.  Object  Deletion  from  a  Process  Address  fpace 

In  conjunction  with  the  linker,  a  module  of  the 
operating  system  must  exist  to  delete  an  object  from  the 
address  space  of  a  process.  When  invoked  by  the  linker,  tnis 
module  would  use  some  policy,  such  as  least  recently  used  or 


first-in,  first-out,  to  select  an  object  for  removal.  The 
module  would  notify  Memory  Management  that  the  object's 
segment  number  is  no  longer  in  use  ard  reset  the  object's 
entry  in  the  process  reference  table  to  nil.  The  module 

would  then  inform  the  linker  of  the  segment  number  of  the 
deleted  object.  The  linger  can  now  unsnap  links  to  the 
object . 

It  is  useful  to  point  out  policy  considerations  for 
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is  called  to  look 
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referenced  object.  It  may,  therefore,  be  advantageous  to 
keep  track  of  the  number  of  links  to  an  object  to  avoid 
removal  of  a  segment  which  is  referenced  mary  times.  (One 
should  not,  however,  strictly  delete  the  object  referenced 
the  least  number  of  times  since  this  may  well  be  the  last 
object  entered  in  the  address  space  and,  applying  the 
principal  of  locality,  be  subject  to  further  usp  in  the  rear 
future. ) 

i 

Another  important  item  to  b<=  considered  before 
selecting  a  subroutine  for  removal  is  whether  it  will 
eventually  be  returned  to  by  the  currently  executing 
procedure  (i.e.t  it  has  a  current  activation  record^.  As  an 
example,  say  procedure  A  called  procedure  B  which  call°d 
procedure  C.  But  before  C  could  be  linked  an  unlinking 
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evolution  was  required.  Certainly  one  would  rot  wart  to 
remove  A  or  E  to  make  virtual  memory  available  for  C  siu^e 
these  two  procedures  wculd  bp  returned  tc  when  C  completed 
executing  and  the  linking  process  has  only  been  defined 
during  a  procedure  call.  Thus,  if  A  cr  E  were  ur.lir>ed,  C 
would  return  to  a  non-existant  module  which  it  cculd  not 
link  to  or  access  (since  A  or  I  would  no  longer  be  in  me 
process  address  space.) 

If  the  information  necessary  to  determine  whether  a 
procedure  has  a  current  activation  record  is  not  readily 
available,  there  is  an  easily  implementable  mechanism  for 
determining  this.  A  counter  can  be  assigned  to  earn 
procedure  (in  a  process  address  spaced  that  would  be 
incremented  or  decremented  as  the  procedure  is  invoked  or 
completes  execution.  Thus,  a  procedure  wnose  counter  is  zero 
has  no  current  activation  records  and  is  available  for 
removal.  The  counter  could  be  updat-ed  by  code  in  the  snapped 
link  and  could  be  located  in  a  procedure's  linkage  table  or 
linkage  address  table  entry.  Tnis  implies  that  the  linker 
must  be  involved  in  the  selection  of  an  object  for  removal. 

4.  Process  Initialization 

Frccess  initialization  involves  those  functions 
vnich  must  be  carried  out  by  the  operating  system  prior  to 
commencement  of  program  execution.  A  brief  review  of  these 
functions  is  offered  at  tnis  time  with  a  more  detailed 
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discussion  available  ir.  work  by  Jansen  [7,  ej  . 

lefore  a  process  can  corin’, ence  execution  of  a 
program,  the  program's  linkage  table  (program  .link'  and 
linkage  address  table  must  be  allocated  a  settlor,  of  the 

process  address  space  and  both  tables  rust  be  initialised 
(or  built  from  a  template  in  the  case  of  program .  1  ink ) . 
Additionally  the  linkage  pointer  must  be  set  to  point  to 
program. 1  ink.  The  operatic?  system  must  initialize  the 
process  reference  table  with  the  applicable  data  for  the 
program  to  be  executed.  Once  this  is  accomplished,  ~alls  by 
<program>  can  be  dynamically  linked. 

C.  TF.ANS1ATOS  SUFPOST 

The  process  cf  dynamic  linkin?  is  only  practical  if  the 
translator,  whether  a  compiler  or  assembler,  has  been 
designed  to  support  dynamic  linking.  In  a  (translator) 
supported  system,  the  translator  must  be  ablp  to  identify 
external  references,  build  the  symbolic  name  tablp  and 
linkage  table  template,  and  identify  entry  points  and  entry 
names.  A  translator  will  be  assumed  to  produce  relocatable 
object  code  allowing  dynamic  relocatior  of  object  rode 
segments — either  by  relocation  hardware  or  software. 

"ogether,  the  translator  and  the  linker  must  meet  two 
requirements.  First,  the  object  code  must  remain  pure  during 
the  linking  process  to  allow  use  of  shared  prcreJure 
segments  in  a  multiprogramming  environment  (i.e.,  the  pure 
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object  code  criterion  o TiM*-  i  .  In  d  d  i  tier,  tne  '-ode 
produced  by  the  translator  aioi.r-  *:tr.  t.ne  steps  fcllo-ed  ir. 
th°  1  icici  zr  process  rest  not  lirit  f»  itu  res  of  the  sc  roe 
language  'i.e.,  tne  syntactic  competabi  li  t”  criterion). 

1.  7xternal  References 


A  translator  "'ust  be 
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t  -> 
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produced  by  the  translator  i^  tc  an  aiir-ss  wd-h  ■-'ar.  :e 
expressed  as  tne  value  o*“  the  linkage  rc  i  r  t c  r  plus  sore 
offset.  Since  the  translator  constructs  the  linkage  table 
template,  it  knows  the  relative  offset  for  a  sy-r^di'*  race's 
outf'Cine  link  in  the  linkage  table.  As  r.as  been  noted, 
because  the  linkage  pointer  identifies  tne  he-’i nr i ng  cf  tne 
executing  procedure's  linkage  table,  tne  object  cede  for  an 
external  reference  can  he  designed  tr  call  the  cutgcinr  link 
desired.  (The  use  of  the  linkage  pointer  ensures  tne  purity 
of  a  procedure's  trarslatei  code.) 

2 .  Symbolic  Name  Tables  ar.i  Ten-plates 

The  translator  builds  both  tne  symbolic  name  t-Ms 
and  the  linkage  table  template.  This  should  not  present  ary 
major  problem  for  the  translator  since  all  irferretien 
reauired  to  -onstruct  these  two  items  is,  in  general,  either 
easily  computable  or  found  in  the  translator's  symbol  table. 
Because  the  translator  builds  both,  it  is  not  necessary  for 
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entries  i  r.  c  i  t  r.  e  r  to  be  of  ’ '  n  i  f  o  r  m  size.  "  c.  e  t"arslator,  fcr 
•example,  'dews  t he  offset  (i.e.,  starting  location'  of  a 
symbolic  ran3  tahl°  entry.  Therefore,  whe r  tne  translator 
constructs  the  linkage  table  template,  each  ouW'oin*?  link 

car.  V  ini t  ia 1  iz^d  t c  pass  this  offset  to  tr.e  linker  (on 
first  reference  of  an  object.) 

Notice  that  a  one-tc-cn®  corr°spcnderce  exist 
between  entries  in  tne  linkage  table  body  and  the  symbolic 
nar10  table.  Thus,  if  tne  syrbelic  tare  table  is  constructed 
first,  tne  construction  of  the  ter pi  ate  becomes  trivial. 
A c t e r  the  header  of  the  template  is  built,  the  symbolic  name 
table  is  scanned  and  an  ou t* o i r.«c  or  incomirr  link  is 

initialized  within  the  tempiat**  (depending  or.  the  type  of 
svrbolic  name  encountered).  After  each  template  entry  is 
constructed,  the  offset  of  the  link  fro <-  the  start  of  the 

template  can  he  store'’  into  its  respective  entry  in 

oh je~ t . sym . 

? .  Sr.  try_'Jares  and  Snt  ry_Fo i r.  ts 

The  translator  should  b°  able  to  recognize  both 
entry  na’-es  and  tneir  associated  cntry  pcirts  ar.J  make 
appropriate  linkage  table  and  symbolic  care  table  entries 
accordingly,  '"he  inclusion  c f  e r.  t r y  pcirts  in  tne 

implementation  cf  a  dynamic  linker  is  hipvly  desirable, 
particularly  in  a  svst»m  with  a  limited  virtual  memory  size. 
I.o  this  environment  the  nurber  of  unlinking  evolutions  may 
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be  significantly  reduced  by  using  entry  points  to 

srall  data  or  procedure  objects  into  larger  ones 

12 

losing  the  smaller  object's  addressability 


comb ine 
w i thout 


This  process  is  kno*n  as  binding  in  Unities  [11]. 
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The  following  is  a  discussion  of  the  various  tables 
associatea  vitn  a  dynamic  linger.  The  formats  presented  uo 
net  rei-reser.  t  the  only  structures  possible,  however,  they 
contain  all  information  necessary  for  ay  .sarnie  iin'finp. 

1 .  The  Symbolic  hare  Table 

An  entry  in  tne  symbolic  name  table  (fi*v.re  = )  ir. 
addition  to  the  symbolic  name  includes  two  ether  iters.  The 
first  is  a  descriptor  consisting  of  a  type  tit  to  identify 
the  object  as  procedure  or  data;  an  identity  tit  tc  classify 
the  symbolic  name  as  an  external  reference  verses  entry 
name;  ar.d  a  size  field  to  pass  tc  the  1  i n .ter  the  nurb°r  of 
characters  in  the  symbolic  name. 


A  second  i  ten  to  be  included  is  tr.e  offset  of  a 
symbolic  name's  entry  in  tne  subroutine's  linkage  tcble.  lor 
external  references,  the  inclusion  cf  the  offset  in  a 
symbolic  name  tatle  entry  is  not  necessary*'  However,  its 
inclusion  does  remove  the  requirement  for  the  linker  to  save 
this  information  when  it  (the  linker)  is  invoked  by  ar. 
out^oine  link.  However,  for  an  access  'entry  point'1  into  an 
object,  the  offset  (of  the  incoming  link)  must  be  included 
in  the  symbolic  name  table  to  ensure  the  linker  knows  where, 
within  object. link,  to  construct  the  incoming  lirk.  Tr.e 
third  item  found  in  the  synbolic  name  table  is  the  entry 
point  (offset)  associated  with  each  entry  name  declared 
within  an  object,  (The  entry  point  is  used  to  construct  a 
virtual  address  of  the  form  <se<?men  t_ number !  er.tr.y_pc  in  t>. 
This  virtual  address  is  used  in  the  incoming;  link  to  invoke 
the  called  external  procedure.) 

It  may  be  desirable  to  separate  the  symbolic  name 
table  into  two  sections  consisting  of  external  references 
and  entry  points.  Assuming  the  entry  points  follow  the 
external  references,  a  pointer  to  the  be?innins“  of  the  entry 
points  should  be  stored  at  the  ter'inninr  of  the  table  to 
allow  the  dynamic  lirker  to  jump  directly  to  tne  entry  point 
section  when  required.  This  feature  would  permit  faster 
access  for  bo  tn  since  each  would  b®  stored  in  a  scalier  data 
structur®.  If  this  table  organization  is  used  it  would  not 
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be  necessary  to  include  an  identity  tit  in  the  descriptor  of 
an  entry.  (Note  that  the  syrr.boli'  name  table  is  searched  for 
entry  points  only,  since  external  references  are  accesses 
directly  via  the  outgoing-  link.) 

It  is  natural  to  ask  where  the  symbolic  rare  table 
of  an  object  is  located  within  a  process.  It  is  su<?eested 
that  'or  procedures,  tne  symbolic  name  table  be  appended  to 
the  end  of  a '  pro ceaure ' s  object  code.  This  will  require  only 
one  copy  of  the  symbol!1'  name  table  (which  represents  a  rure 
data  structure)  in  a  shared,  mul ti pro^rammi n?  environment. 
However,  for  external  data,  tne  symbolic  name  table  cannot 
be  located  at  the  end  of  the  data  since  it  will  limit  the 
ability  of  the  data  structure  to  srow  dynamically.  A  better 
solution  would  be  to  merge  the  data.sym  with  data. link  and 
store  the  two  in  the  combined  linkage  table.  This  format 
allows  the  data  to  be  based  at  offset  zero  and  stow 
dynamically.  (The  general  form  of  a  data  symbolic 
name/linkage  table  is  given  in  figure  9.) 

2 .  The  linkage  Table 

a.  The  Initialized  Linkage  Table 

The  initialized  linkage  table  is  shown  in  figure 
6.  The  header  of  the  linkage  table  contains  three  items.  Tne 
first  is  the  size  of  the  linkage  table.  This  item  tells  the 
lirk°r  tne  size  cf  the  template  wnen  building  an  object's 
linkage  table  and  also  is  used  by  tne  linker  to  adjust 
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linkage  tatle  addresses  wnen  removing  a  linkage  table  (dur¬ 
ing  unlinking).  (Recall  tnat  linked  list  threads,  linkage 
address  table  entries,  and  jumps  within  the  linkage  table 
must  be  adjusted  by  the  size  of  a  removed  linkage  table  dur¬ 
ing  compaction  of  the  combined  linkage  table.'  The  second 

\ 

and  third  items  found  in  the  header  consist  of  the  virtual 
address  of  the  symbolic  name  table  and  a  pointer  to  the  head 
(i.e.,  a  snapped  outgoing  link  to  the  object)  of  the  linked 
list  used  in  unlinking. 

Each  outgoing  link  in  the  body  of  the  linkage 
tatle  template  is  initialized  to  two  instructions.  The  first 
instruction  passes  the  entry's  offset  in  the  symbolic  name 
table  to  the  linker  (as  an  argument).  The  second  is  an 
instruction  which  results  in  tne  invocation  of  the  linker, 
logically,  the  twe  instructions  found  in  the  initialized 
outgoing  link  equate  to: 

CALL  Linker  (symholic_name_table_of? set ) 

The  designer  can  chose  from  three  basic 
mechanisms  that  may  be  used  to  invoke  the  linker.  First,  if 
the  translator  knows  the  virtual  address  of  the  linker  (such 
as  a  fixed  or  reserved  segment  number),  then  the  outgoing 
links  in  a  template  can  be  tailored  to  invoke  the  linker 
directly  (e.g.,  JTJM?  virtual  address  of  <Iir.ker>).  Tne 
second  method  is  to  invoke  the  linker  by  a  hardware  fault 
which  will  result  ir.  tne  linker  being  called  as  tne  fault 
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handler.  Tne  translator  would  therefore,  initialize  eacn 
outgoing1  link  to  push  the  offset  of  the  symbolic  name  on  tne 
machine  stack  and  then  induce  a  hardware  fault.  The  tr.ird 
mechanism  is  for  the  linker  to  enter  its  own  virtual  address 
in  each  outgoing  link  as  it  builds  a  procedure's  linkage 
table.  fThis  represents  the  least  desirable  technique  since 
it  requires  the  linker  to  know  the  format  of  the  body  of  a 
template  and  furthermore  is  much  slower  since  the  template 
must  be  scanned  as  the  linkage  table  is  built.) 
b.  Format  of  Snapped  Links 

A  format  for  snapped  out^oinc>  links  tc  external 
data  and  procedures  are  shown  in  figure  1Z .  The  snapped  out¬ 
going  link  for  a  procedure  consist  of  a  jump  to  the  incoming 
link  in  the  called  procedure's  link  for  an  external  proce¬ 
dure's  linkage  table.  The  snapped  incoming  link  leads  the 
linkage  pointer  with  the  virtual  address  of  the  called 
procedure's  linkage  table  (viz.,  Target .link) ,  and  then 
jumps  to  the  called  procedure  (as  defined  by  some  entry 
point).  For  external  data,  the  snapped  oute-oin*?  link  'ensist 
of  an  instruction  which  loads  a  register  with  tne  virtual 
address  of  the  data  followed  by  a  return  instruction. 
( Fecal 1  that  this  technique  is  used  when  tne  available  ha  rd- 
ware  does  not  support  an  indirect  addressing  approach.' 

’’’he  two  items  common  to  both  entries  (as  shown 
in  figure  1£),  'offset'  and  'linked  list  pointer',  represent 


information  to  be  used  during  unlinking.  The  offset  (of  tne 
symbolic  name  table  entry  corresponding  to  the  outroin? 
link)  is  used  when  resetting  the  entry  to  its  initialized 
form  while  the  linked  list  pointer  allows  the  unlinker  to 
find  each  entry  in  tne  combined  linkage  table  which 
references  the  object  being  removed, 
o .  The  linkage  Address  Table 

’’’o  facilitate  each  access  to  a  subroutine 's  linkage 
table  within  the  combined  linkage  table,  the  linkage  address 
table  is  used.  Entries  are  subscripted  by  segment  number  and 
contain  the  offset  of  an  object's  linkage  table  within  the 
combined  linkage  table.  (Mote  that  if  linkage  tables  were 
allocated  individual  segments,  vice  a  portion  of  the 
combined  linkage  table,  the  linkage  address  table  would 
contain  the  virtual  address  of  an  object's  linka.ee  table.) 

The  problem  arises  as  to  where  in  a  process  address 
space  the  linkage  address  table  should  be  located.  Cr.e  would 
like  to  avoid  allocating  the  linkage  address  table  its  own 
segment  and  pointer  register  since  these  resources  within  a 
microprocessor  are  usually  limited.  Assuring  the  linkage 
address  table  is  initialized  at  process  creation  and  is  a 
fixed  length,  a  possible  solution  is  to  place  it  at  the  head 
of  tne  combined  linkage  table.  If  this  approach  is  used,  the 
table's  base  address  would  be  the  segment  number  of  the 
linkage  table  (which  is  sto"ed  in  the  linkage  pointer)  with 
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an  offset  of  zero. 


F.  lM?IFivZNTATI  ON  OF  ENTF.Y_NAyES  ANT  ENTEY_?OINTS 

mo  avoid  confusion,  sore  of  the  fine  points  related  to 
the  implementat i on  of  entry  names  and  entry  points  will  te 
discussed  at  this  tire. 

First,  if  an  object  has  multiple  entry  points  declared 
withir.  it,  each  entry  point  must  have  a  unique  entry  in 
object. sym  and  a  unique  incoming  lint  in  object. link.  This 
is  logical  si  nee  each  entry  point  defines  a  distinct 
location  in  an  object.  Secondly,  if  a  procedure  contains 
external  references  to  several  entry  points  withir.  the  same 
object,  eacr.  unique  reference  must  have  its  owr  entry  witnir. 
the  procedure's  symbolic  name  table  and  its  own  outmoinf 
link.  rFor  example,  ^Tar^et !  Entry_Name_l'>  and 
<Tar;?et !  Entry_Name_2>  represent  distinct  ref e  rer.~es  . ) 

Notice  that  the  start  of  an  object  represents  an 
(frequently  implied!  entry  point  which  must  be  included  in 
the  object's  symbolic  name  table  and  nave  an  incoming  link. 
However,  one  would  like  not  to  explicitly  include  such  an 
'entry  point'  (e.*-.  <Tar<?et  j  Target> )  in  an  external 
reference.  Therefore,  it  is  summested  that  an  implementation 
default  to  this  implied  entry  point  in  the  absence  of  an 
entry  name. 
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V.  LINKING  WITHOUT  TRANSLATOR  SUPPORT 


In  *11  probability,  the  initial  implemer. taticr  of  a 
dynamic  linker  will  not  enjoy  the  translator  support  which 
has  oreviously  been  assured  to  exist.  Yet,  within  reasonable 
limitations,  one  would  like  to  te  able  to  utilizing  the 
features  of  dynamic  linking  in  an  unsupported1-^ 
environment.  Furthermore,  it  is  desirable  to  be  able  to  us e 
one  dynamic  linker  for  both  supported  and  unsupported 
procedures,  to  be  able  to  execute  both  supported  and 

unsupported  modules  within  a  process,  and  to  be  able  to  rail 
an  external  procedure  from  a  supported  module  without  having 
to  specifically  declare  the  procedure  to  be  supported  or 
unsupported.  (This  implies  the  linker  must  be  abl^  to 
differentiate  supported  procedures  from  unsupported  on°s.) 
An  implementation  is  proposed  wnicn  achieves  these  goal. 

A.  THE  INTERFACE  YCLULES 

In  an  unsupported  subroutine,  the  linker  should  be 
invoked  via  a  data  or  procedure  interface  module.  Two 
separate  modules  are  summested  since,  besides  the  fact  that 
their  functions  differ,  the  data  interface  module  must 
return  the  virtual  address  of  the  external  data  to  the  point 

For  the  purposes  of  this  thesis,  'supported'  will  be 
used  when  referring  to  an  environment  in  which  the 
translator  supports  dynamic  linking  while  'unsupported'  will 
be  used  to  reference  environments  which  lack  this  feature. 

62 


r  7 


of  call  while  the  procedure  interface  module  merely  execute? 

14 

the  snapped  link  .  Conceptually,  an  interface  module 
carries  out  those  functions  which,  in  a  supported  system, 
require  some  translator  support.  These  functions  include 
building  the  symbolic  name  table  and  the  linkage  table, 
invoking  the  linker,  and  executing  a  snapped  lin*c. 

1 .  linking  of  Procedures 

To  best  describe  the  functions  of  the  procedure 
interface  module ,  the  s  teps  to  dynamically  link  the 
unsupported  procedure  <Tar^et>  to  the  unsupported  procedure 
<Caller>  will  be  traced  (figure  11).  It  will  be  assured  that, 
the  procedure  interface  module  is  called  as  follows: 

CALI  IINK^FROCdar^et,  parameter^!*  pa  rameter_2 ,  .  .  .  ) 

The  first  function  that  dLIMK^FFCO  would  perform 
would  be  to  save  the  value  of  the  interface  linkage  pointer 
on  a  software  stack.  3e cause  a  translator  which  does  not 
support  dynamic  linking  would  not  know  that  the  linkage 
pointer  register  is  not  available  for  freneral  use.  in  all 
probability  object  code  produced  would  utilize  the  linkage 
pointer  register  requiring  an  interface  linkage  pointer  be 
established  and  saved  in  software.  (In  a  supported  system 


This  requires  that  two  interface  modules  (one  for 
procedures  and  one  for  data)  be  implemented  cue  to  the  fact 
that  most  hie  he r  level  languages  have  a  different  syntax  for 
procedures  which  return  arguments  to  the  point  cf  ''all 
(verses  those  which  do  not). 
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CALLER. SYM 


SEQUENCE  OF  EVENTS  FOR  LINKING  UNSUPPORTED  OBJECTS 


savin?  tne  linkage  pointer  register  is  acccmpl i sped  by  the 
translate!  cole.)  This  implies  that  there  are  two  linkage 
pointers  (viz.,  a  hardware  lin'd,-?®  pointer  and  an  interface 
linkage  pointer),  and  both  rust  be  initialized  to  point  to 
tne  beginning  of  tne  linkage  taole  for  <prcgram>  at  process 
initialization.  It  should  te  noted  that  tne  la*?  instruction 
o'  <LINK$FRCC>  rust  reset  the  interface  linkage  pointer  by 
p o p  1  n g  the  saved  value  off  a  software  stack  y  r  i  o  r  to 
returning  to  <Caller' . 

dl  INK$?E0C>  will  tnen  check  to  see  if  an  entry  for 
<Target>  exist  in  Caller. syr.  If  net,  <1  IMTiFROO  will  enter 
<Target>  in  Caller. syn  along  with  the  offset  of  tne  outgoing 
link  for  <Target>  In  Ca  1  ler .  1  ink .  <T L I M K  sFP.CO  is  able  to 
enter  this  offset  because  it  has  constructed  an  outgoing 
link  for  <Target>  in  tne  next  free  location  in  Caller. link. 
(The  outgoing  link  for  <Target>  is  of  the  sane  format  found 
ia  a  supported  svster  linkage  table.)  Tnis  outgoing  link  is 
executed  and  the  linker  is  invoked.  (Tn®se  st®ps  ensure  the 
use  of  tne  sane  linker  for  both  supported  arc  unsupported 
linking  since  the  rethod  of  linker  invocation  does  r.ot 
change . ) 

It  should  pointed  out  tha t  ha^  an  entry  for 
<Target>  already  existed,  then  <7a  rgetl  would  have  already 
beer  linked  to  <Caller>  and  <LIMn5??.0C>  would  only  hav®  to 
execute  the  snapped  link  (which  it  can  find  since  tne  offset 


65 


cf  toe  snapped  incoming  link  in  Ca  1  Ip  r .  1  i  r.k  is  save"4  ir. 
Caller. sym. 

Once  the  linker  is  called,  it  will  first  determine 
if  <Tare,et>  is  a  supported  or  unsupported  procedure.  The 
actual  mechanise  used  to  perform  this  meek  will  vary 
depending  on  the  operating  system.  One  rear.s  of  performing 
this  ch°ck  is  to  ta^  modules  within  the  file  system.  An 
alternative  vo”ld  te  to  tailor  the  first  tyte  of  c  supported 
module  to  identify  it  as  such.  (One  must  er.su r<=  wn°n  usiu^: 
tnis  method  that  an  unsupported  module  cannot  have  the  same 
hit  pattern  for  its  first  byte.)  In  this  thesis  it  will  be 
assumed  that  the  linker  can  query  the  file  system  to 
determine  whether  a  module  (external  object)  is  supported  cr 
not.  The  ability  of  the  linker  to  accomplish  this  check 
allows  an  external  reference  within  a  supported  subroutine 
to  have  the  same  format  regardless  of  whether  the  external 
object  referenced  is  supported  or  not.  This  prevents  having 
to  modify  and  retranslate  modules  when  an  unsupported  object 
is  retranslated  in  a  supported  environment. 

When  the  linker  determines  that  <?s.rget>N  is 
unsupported,  it  will  call  on  a  routine  in  ^II\K£FECC'  to 
allocate  a  section  of  the  combined  linkage  tabl°  to  be  used 
as  Tare-et.sym  and  Target .  link.  This  implies  that  t  h  ■=  next 
free  location  in  the  combined  linkage  tabl®  rust  be 
available  to  < 1 1 N K * PK 0 C >  in  addition  to  tne  linker  'for 


constructing  linka6e  tables  since  kLINKSPROC^  must  build  trie 
linkage  table  for  an  unsupported  subroutine.  Target. sym  ran 
be  located  within  Target. link  (vice  Target's  object  cede) 
since  the  linker  finds  Target. sym  via  a  pointer  in 
Target. link  (figure  12).  Additionally,  <IINKSPROC>  will 
construct  CTargeO's  linkage  address  table  entry  and  will 
initialize  the  header  of  Target. link. 

The  linker  needs  to  know  whether  ^Tar^e  t>  is 
supported  or  not  for  one  other  important  reason.  Erecutior. 
cf  <?a  r^e  t>  is  initiated  via  a  jump  from  <Cal  leri  .  li  r.k  to  an 
incoming  link  in  "’arget  .link.  incoming  link  normally 

consist  of  an  instruction  to  set  the  linkage  pointer 
r°^i  s  te  r  to  point  to  Tar  t’e  t.link  followed  by  a  jump  to 
<Tarpet>.  However,  if  <Tareet>  is  unsupported,  it  is  the 
interface  linkage  pointer  vice  the  linkage  pointer  register 
which  rust  be  set  requiring  the  linker  be  able  to 
distinguish  between  supported  and  unsupported  ext-r..al 
orocedures  and  snap  incoming  links  accordingly.  ..ius  ... e 
unsupported  incoming  link  will  be  of  the  forr: 


Interface  Linkage  pointer  =  Eas°  address  cf  Ta  rge  t .  1  i  n.c 
Jump  to  <Tar&et> 

Note  that  <TINZ*??OC'>  is  passed  net  only  tr.e 
symbolic  na^e  Target  ,  but  also  all  ot  <.Turcet  s 
parameters.  This  implies  that  CIIiNKiEROO  must  be  avle  tc 
pass  these  parameters  to  Clar^et')  in  accordance  w:tr.  trie 
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conventions  of  the  translator  which  compiled  <T  =  rget'>. 

2.  linking  of  Data 

The  sequence  of  events  to  link  (the  unsupported 
object)  <Iata>  to  <Caller>  would  be  quite  similar  to  tvose 
linking  <Ta rget>.  Assuming  <Ta  ta>  had  not  yet  be=n 
referenced  by  the  executing  process,  the  interface  module 
<IIMi"ATA>  would  build  an  outgoing  link  for  <Ta  ta)  in 
Caller. link  and  enter  the  symbolic  name  Lata  in  Caller. sym 

(as  <LIN^PR0C>  does  for  <Target>). 

The  linker  would  then  be  invoked  and,  upon 
determining  <Data>  to  be  unsupported,  would  "all 

<L I NE$EATA> .  <LINE$DATA>  would  construct  a  linkage  table  for 
<Data>;  however,  the  construction  of  Lata. link  woull  be 
trivial  since  it  consist  of  only  a  linked  list  pointer  ard  a 
linkage  table  size^entry  (figure  S'.  As  will  be  discusser, 
unsupported  objects  cannot  have  multiple  entry  points.* 
therefore.  Oata>  does  not  require  a  symbolic  name  table. 
Following  the  construction  of  Data. link,  the  linker  would 
snap  the  link  between  <Iata>  and  <’CallerN.  The  snapped  xir..< 
in  this  situation  would  differ  somewhat  in  that  or.ce 
snapped,  the  link  will  be  used  tv  <LINF$DATA'>  to  obtain  the 
virtual  address  of  <Data>.  <IINK$BATA>  completes  the 
reference  to  <Data>  by  returning  <'Data>'s  virtual  address  to 

(the  Doint  of  call)  in  <Caller>  . 


9 


E.  i  ly  I- a'*:? \IS  Of  *JN  SUPPORTED  IINTIVG 

There  are  four  major  iisadvan  tages  when  linking  in  an 
unsupported  environment.  Three  of  these  represent  violations 
of  design  criteria  (as  specified  in  Table  1)  while  the 
fourth,  the  inability  to  implement  multiple  entry  points,  is 
considered  a  limitation  in  the  flexibility  associated  with  a 
dynamic  linking  environment. 

The  first  disadvantage  is  that  unsupcorted  linking 
results  in  excessive  overhead  for  subsequent  references  to 
an  external  object,  as  required  by  tne  limited  overhead 
criterion.  This  is  a  direct  result  of  the  fact  that  tne 
interface  module  must  be  invoked  for  earn  external  reference 
to  perform  those  bookkeeping  functions  (such  as  manipulating 
the  interface  linkage  pointer)  which  in  a  supported 
environment  are  performed  by  the  translated  external 
reference  and  the  snapped  link. 

A  second  disadvantage  is  that  an  external  procedure  must 
be  linked  before  it  car.  be  passed  to  a  subroutine  as  a 
parameter.  This  contradicts  the  delayed  binding  criterion. 
Furthermore,  to  pass  an  external  procedure  as  a  parameter 
reauires  a  third  interface  module.  A  third  interface  module 
is  called  for  since  <LINK$F?.0C>  can  only  link  and  invoke 
external  procedures  whereas  to  pass  a  procedure  as  a 
parameter,  it  is  necessary  to  have  access  to  tv' e  procedure's 
virtual  address.  (In  the  case  of  an  external  procedure,  it 
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is  sufficient  to  pass  the  virtual  address  of  tue  procedure's 
outgoing  link:  vice  the  virtual  address  of  the  external 
procedure  itself.)  Therefore,  the  third  interface  nodule 
will  snap  the  link  (in  violation  of  the  delayed  binding 
criterion)  and  return  the  virtual  address  of  the  external 
procedure 's  outgoing  link  to  the  point  of  call. 

The  third  disadvantage  involves  a  violation  of  the 
syntactic  compatibility  criterion  for  external  data.  Note 
that  the  utilization  of  external  data  is  limited  to  a  (FL/I 
or  PI/y)  based  variable  structure  since  <1 1 MTT  $  HAT  ®>  can  crly 
return  the  virtual  address  (of  the  external  data'  to  the 
point  of  call. 

The  final  disadvantage  is  that  multiple  entry  points 
cannot  be  implemented  in  an  unsupported  object,  finre  tne 
(translator  constructed)  symbolic  name  table  is  necessary  to 
retain  the  entry  name  and  entry  point  associated  with  ar 
access  into  an  object,  an  unsupported  object  can  only  be 
referenced  at  its  conve  nt  i  or.a  1  starting  location. 
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V I .  HARI'kAPE  TO  ENHANCE  L Y Nt A Y I C  LUTING 

Even  though  care  has  been  taken  tc  ievelc^  a  dynamic 
linker  which  is  not  dependent  on  the  availability  of  certain 
hardware  features,  there  are  hardware  capabilities  whi-'h  are 
desirable  in  a  dynamic  linking  environment.  In  general, 
these  features  can  be  divided  into  two  general  categories: 
those  which  effect  the  design  of  the  linker;  and  these  which 
impact  or.  system  performance. 

It  is  emphasized  that  the  following  discussicr  is 
presented  with  the  idea  that,  if  one  is  iroir.g  to  include 
dynamic  linking  in  a  system  and  has  a  choice  cf  processors, 
one  should  look  for  certain  hardware  features  which  are 
desirable  in  a  dynamic  linking  environment.  This  section 
should  not  be  viewed  as  a  list  of  hardware  support  necessary 
for  the  feasible  implementation  cf  a  dynamic  linker. 

A.  HA? E WAP L  FEATUF Fc  AFFECTING  I INKER  RESIGN 

All  the  hardware  features  discussed  in  this  section 
dictate  in  some  manner  how  certain  functions  of  a  dynarU 
linker  must  be  implemented.  However,  the  first  two  features 
discussed  (viz.,  indirect  addressing  and  a  hardware  fault  on 
indirection)  are  necessary  to  allow  a  linker  to  fully  r=et 
the  design  criteria  of  Table  1. 
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Hardware  Indirection  and  Faults  cn  Indirection 


For  the  rost  part,  it  has  teen  assuned  the  linker 
was  invoked  (or.  tne  first  reference  of  an  object )  via  the 
initialized  code  of  the  outgoing  link.  However,  the  most 
desirable  rrethod  of  linker  invocation  requires  t*e  processor 
to  provide  two  hardware  features:  Cl)  The  ability  to 
reference  data  and  call  procedures  using  indirect  addressing 
through  memory?  and  (2)  the  ability  to  generate  a  hardware 
fault  during  indirection. 

When  a  hardware  fault  on  indirection  is  available, 
references  to  external  objects  are  achieved  via  indirect 
addressing  instructions  where  the  final  'target  address'  (in 
the  indirection  sequence)  is  stored  in  tne  outgoing  link  of 
the  executing  procedure's  linkage  table.  The  outgoing  link 
is  initialized  to  cause  a  fault  (on  indirection)  which 
results  in  the  invocation  of  the  linker  as  the  fault 
nandler.  The  linker  snaps  the  outgoing  link  by  altering  the 
initialized  fault -i nducing  code  to  either  the  virtual 
address  of  the  incoming  link  (for  external  procedure  calls) 
or  the  virtual  address  of  the  external  data.  (This 
represents  the  method  used  in  Multics  [11].) 

Without  a  fault  on  indirection,  it  is  net  apparent 
how  to  pass  external  data  as  a  parameter  wit.oout  first 
snapping-  the  link  to  the  data.  This  represents  a  violation 
of  the  delayed  binding  criterion  (of  Table  1)  because  the 
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binding  of  a  symbolic  name  to  a  virtual  address  has  been 
performed  prior  to  first  reference.  (Note  that  ever,  thougn 
tne  external  data  is  passed  as  a  parameter,  it  may  not 
necessarily  be  referenced  within  the  procedure.1^  ) 

2.  Other  ^features  Influencing  Linger  Implementation 

There  are  certain  hardware  features  which  do  not 
restrict  the  implementation  of  a  dynamic  linger,  .  tut  no 
effect  certain  aspects  of  the  linker  design.  Two  hardware 
features  which  are  considered  advantageous  in  a  dynamic 
linking  environment  will  be  discussed. 

The  first  feature  relates  to  the  number  of  segments 
available  in  a  process  address  space,  tfcre  specifically.  if 
there  are  adequate  segments  (and  each  segment  is  cf 
reasonable  size),  then  it  may  not  be  necessary  to  frequently 
execute  tne  unlinking  portion  of  a  dynamic  linker.  (Note 
that  unlinking  is  still  necessary  because  segments  deleted 
from  an  address  space  shcul'’  be  unlinked.)  This  is 
considered  advantageous  since  unlinking  is  considered  one  of 
the  more  expensive  functions  to  execute.  Note  that  if 
unlinking  is  not  implemented,  segments  can  always  be 
conserved  by  combining  smaller  objects  into  a  single  segr er. t 
and  referencing  each  object  via  an  entry  point. 

^  One  is  free  to  judge  how  much  of  a  limitation  the 
absence  cf  these  two  hardware  features  presents,  new-over, 
the  author  does  not  consider  it  very  prohibitive. 
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The  object  cole  produced  by  the  translator  is 
subject  to  the  hardware  features  available.  Ir  a  dynamic 
linking  environment,  some  hardware  features  tend  to  simplify 
the  object  code  produced  for  an  external  reference.  Fcr 
example,  if  hardware  registers  are  automatically  saved  by 
the  procedure  CALL  and  RETURN  conventions,  then  it  is  net 
necessary  for  the  object  code  (during  an  external  prooe^ure 
call)  to  explicitly  save  and  reset  the  linkage  pointer, 
possess  an  indirect  addressing  C  A 1 1  instruction  but  car.  cr.ly 
perform 

E.  EAPEWAF.  i  FEATURES  AFFECTING  SYSTEM  PEEF CP.MAXCF 

There  exist  hardware  capabilities  whi cn  pnnarce  system 
performance  in  a  dynamic  linking  environment.  These 
capabilities  do  not  directly  effect  the  design  cf  the 
linker?  but,  because  of  the  requirements  dvr.amic  1  i n  k i Uf 
places  cn  the  operating  system  (such  as  dynamic 
relocatability  of  code),  the  inclusion  of  certain  hardware 
features  serves  to  improve  overall  system  perf orma r.ce . 

In  a  dynamic  linking  environment,  subroutines  are  not 
bound  to  virtual  addresses  (in  a  process  address  space) 
until  run  time.  Therefore,  they  rust  reside  or.  secondary 
storage  in  a  relocatable  form  and  be  dynamically  relocated 
during  process  execution.  Thus,  tee  more  efficiently  cc-ce 
can  be  relocated,  the  better  system  performance  (viz., 
execution  speed)  will  be.  This  implies  that  hardware 
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relocatability  of  code  is  desirable. 

A  second  hardware  capability  which  enhances  system 
performance  is  hardware  segmen ta t ion .  Even  though  tne  linker 
design  is  not  dependent  on  tne  support  of  segmentation 
hardware,  many  of  the  attributes  associated  with  procedure 
and  data  objects  (which  are  logical  entities,  or  segments), 
are  in  fact  intrinsic  to  segmentation.  These  attributes 
include  object  (unique)  identifiers  (viz.,  segm»r.t  numbers) 
and  object  virtual  addresses  (viz.,  an  object  segment  rvrber 
+  offset).  It  is  therefore  reasonable  to  corclude  that 
segmentation  hardware  is  desirable  (but  not  essential)  in  a 
dynamic  linking  environment. 


VII .  A  DEMONSTRATION  TF  DYNAMIC  LINKING 

In  order  to  support  the  design  concepts  of  tnis  thesis, 
and,  in  a  sense,  prove  the  feasibility  of  ri Tocon-put er 
dynamic  linking,  a  subset  of  the  dynamic  linker  Resign  (not 
including  unlinking)  was  implemented  on  an  Intel  £C££  based 
system.  The  8?8£?  microprocessor  [13]  was  selected  b«=causc  of 
its  lack  of  hardware  support,  a  fact  which  supported  the 
contentior  that  the  linker  design  is  hardware  independent. 

The  implementation  consisted  of  five  modules:  1 1  i  a 
process  initialization  module,  (2)  the  dynamic  lirker 
module,  (3)  the  address  space  manager,  '4)  a  display  linkage 
table  routine,  and  (5)  a  package  cf  system  library  routines. 
Three  of  these  modules  (process  initialization,  the  dynamic 
linker,  and  the  address  space  manager)  will  be  discussed  in 
detail.  The  display  linkage  table  routine  was  ircluded  ir 
the  implementation  strictly  to  add  clarity  to  tne 
d°morstraticn  and  will  ret  be  discussed  in  detail.  (Source 
listings  for  the  display  linkage  table  routine  and  tr.e 
system  library  routines  are  provided  in  appendix  (£)  for  the 
interested  reader.) 

The  implementation  of  the  dyramic  linker  ran  on  t.hc  CF/h 
operating  system  [21].  The  hardware  support  included  two 
eieht  inch  floppy  disk  drives  and  55K  of  main  memory. 
Modules  were  written  in  FI/1*-??  [2?]  and  compiled  under  the 
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Isis-II  operating  system  [19]. 

(It  should  be  noted  at  this  tine  that  because  r.o 
translator  which  supported  dynamic  linking  was  available, 
test  programs  were  hand  compiled  to  produce  the  necessary 
object  code,  symbolic  name  tables,  and  linkage  table 
templates . ) 

a.  the  -donums  cj  tfe  dynamic  links* 

Thp  three  major  modules  of  the  linger  v  =  re  th°  trrrcss 
initialization  module,  the  ^dynamic}  linker  module,  and  the 
address  spacc  manager .  Ir i°f ly ,  these  modules  perform  tr.c 
following  functions*. 

?rccc$s  Initialization  is  passed  the  argument  'prc-*ram 
name'  and  performs  the  following: 

1.  Extracts  the  rare  of  the  program  to  be  executed  <’rcm 
the  comnand  line. 

0 .  Causes  the  linker  module  and  tne  address  snare 
manager  to  be  initialized. 

U.  Causes  the  address  space  manager  to  ( l '  enter  tne 
program  in  th°  process  address  space  and  !  2)  load  the 
program  into  memory. 

4.  Causes  the  lirker  module  tc  build  a  lirkage  tabl°  for 
the  program. 

f.  Builds  the  interrupt  handler.  Thc  interrupt  .tardier 
is  invoked  by  initialized  outgoing  links  and,  in  turn, 
invokes  the  linker  •"oduie. 

6.  Starts  the  program  in  execution. 
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” .  If  th°  i I  splay  toggle  was  set  ■  i 'a  the  command 
causes  th.c  process  reference  table  and  cerrbire-4 
table  to  be  displayed  follow  in.?  corpletion  cf 
execution. 


lire). 
1  ir.kage 
o  r  o  r  r  a  m 


The  linger  module  is 
with  the  arguments  'linkage 
offset'  (in  tee  symbolic 
f ollowirr: 


i evoked  (vy  the  fault  handler) 
pointer'  and  'symbolic  ramc 
name  table'  and  perforrs  the 


1.  Extracts  the  character  strir.r  naoe  a^sociatec  with 
the  external  reference  from  the  calling  oro -"edur? 's 
symbolic  name  table. 

2.  Ir voke  s  the  address  space  manager  passing  a?  ar. 
argument  the  symbolic  name  of  the  external  object  (to  be 
linked ) . 

3.  Builds  a  linkage  table  for  the  external  object  (if 
necessary ) . 

4.  Extracts  the  data  associated  with  the  entry  rare 
field  f cf  t.ne  external  reference)  frcr  tr.e  external 
object's  symbolic  name  table. 

5.  Snaps  the  outgoing  and  (if  required)  incoming  links. 

5.  Causes  the  snapped  outgoing  link  to  be  executed  by 
returning-  the  address  of  the  outgoing-  link  to  the 
interrupt  handler.  The  interrupt  handler  then  jumps  to 
♦he  outgoing  link. 


The  Address  Space  t'anager  consists  of  two  submodules. 
AST^MakeSAcces sable  is  invoked  with  the  argument  'symvclic 
name'  (of  an  object)  and  performs  the  following: 


1.  Eeterm ir.es  if  the  object  is  already  in  the  process 
address  space. 
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° .  If  rot,  load?  the  object  into  memory  <  pprf  or"ir..-;  a 
relocation  if  th®  object  is  executable  code)  a  r.  d  ma  ices 
an  entry  for  the  object  in  the  process  reference  table. 

3.  Returns  to  the  point  of  call  the  unique  identifier 
and  base  address  (viz.,  'virtual  address'!  of  the 
object. 

A'f i,DmcveiS?2  is  invoiced,  with  the  ®  retime  rt  'symbolic 

n are'  and  performs  the  following:  .1  .  Feyoves  an  object 
from  a  process  address  space  by  deleting?  tne  cvj®ct's 
entry  in  the  process  reference  tabl®. 

The  implementation  of  each,  of  tr.es®  mod  u le c  will  now  be 
reviewed  in  detail.  The  discussion  will  include 
implementation  details  dictated  bv  trc  r.ardwar®  ar.d 
C?/!*  operating  system  support  utilized. 

1 .  Pro^pss  Initialization 

The  linger  implement  a  t  i  or.  was  call  '“rec'  and  was 
invoiced  by  the  CF/Y  command  lin® 

A'Ftec  pro pt am_narre  *<cr> 

The  first  function  of  process  initialization  was  tc  scar  *r.° 
command  line  to  determine  the  r.am°  of  the  oro^-rar  'viz., 
pro^1*an_name )  to  be  executed.  This  was  performed  by  tr.c 
P.FAItCOt'yANr^LINE  subroutine  which  read  th®  CTO'  buffer  to 
extract  the  program  name.  A  d  d  i  t  i  oral  ly .  if  tr.c  last 
character  of  the  command  line  was  (wnich  is  optional), 
the  display  tommie  was  set  teliir.e  orccess  initialization  to 
displav  the  process  referen ca  table  an"  corbir®d  iir'ra.:? 
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follow  ir.rt  tue  r  cf.  le  t  i r  r  o  . 


ijrcrT -  r  e  7 e ""  t  i  0 . 


Ad  d  i  t  ior.a  1  ly  ,  c  i  r.  c  ^ 


p  r  o  c',  r  a  it 


execut-Pi' 


RSAT  ^COvvA\r^LIM  assumes  for  tne  t)rc*T=^  £  C?  /!'  filetvpe  of 

'  ~  ^  v* '  16 

Process  initialization  trier.  calls  on  tne  sutrorTir=s 
INITIAII^niAf^  and  I M  T I A  L 1 3  ?  A 1 1 N  K I  ?.  wrier.  iriUeliz-=  t-c 
address  space  manager  ar.d  li’->er  •noinle<;  re s  pe  o t  i '••'£  1  y . 
(These  two  subroutines  are  a  part  of  their  respective 
nodules  and  will  be  discussed  in  detail  with  the  r  =  T,er.t 


-odu 1® .  1 


Having  initialized  the  address  soero  n  a  rarer  a  r. 


linker  nodule, 


^ess  initialization  th® n  ®rt®rs  t  h® 


prorran  ir.  the  process  address  space  ar.d  builds  it  a  linkage 
table.  Th®  program  is  entered  in  tb®  address  spa~e  by 
eaiiinp-  or  AS  v  a  y  ART  a  *  CC  PS  S  ABLE  (cassias*  prc’rar  nam®  as  an 
arnnent  .  1  A?  *'•  it'AKPi  ACCF?  S  ABI  7  returns  to  process 

initialization  the  unique  identifier  and  base  address 
assigned  to  the  prorran.  fit  should  le  noted  that  teceuse 
the  SPEC  does  rot  provide  Hardware  se.?r®  n  t  a.  t  i  c  r  ,  it  was 
necessary  to  utilize  a  unique  identifier  and  base  aicress  ir. 


C?/b  utilizes  a  filetvoe  field  t*  distireuisn  tne 
various  tyn°s  of  fil°s  (or.  ',is>  st^raz0'  .  Tne  fil®typ®s 
utilized  ^  y  the  linker  implementation  w®re  fll  Cuv  -  a  file 
of  executable  code:  ' ?)  TTA  -  an  data  file:  (3'  3VP  -  a 
linkage  table  template  file;  and  (4;  PIP  -  a  file  of 
relocation  bits  for  a  CCV  file. 
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place  of  tne  object  segment  number.  1  Process  initialization 
then  calls  on  an  entry  point  into  the  linker  rcitle  •’viz., 
th®  subroutine  1 1 N?)  4G E  $T  43  L Z  S n  ''TJT I N  FS  )  woim  builds  a 
lin^3=»9  table  for  the  program. 

"he  next  function  of  orocess  initialization  is  tc 
build  the  interrupt  vector.  It  was  decided  to  invoke  the 
linker  (vuen  snapping  a  .link)  via  a  software  fault.  Tr.is 
technique  allowed  initialized  oumoin  p  links  tc  be 
independent  of  the  linker  address  by  navi  nr  the  outgoing; 
link  jump  (via.  a  software  fault  1  to  a  predetermined  lo~atior 
wn i oh  then  invoked  the  linker.  (The  software  fault  used  was 
an  ?f?f  F5T  4  instruction  which  saves  tne  current  execution 
point  on  the  stark  and  jupps  to  th®  irterruot  vector  at 
location 

interrupt  vector  first  removes  the  return 
address  placed  on  the  stack  by  tne  FST  4  instruction.  This 
address  represents  the  address  at  The  end  of  to®  c u t c i r. r 
link?  when  me  link  is  snapped,  it  is  desired  to  jump  to 
the  beginning  of  th®  cut.-cim  link  (to  ref  erenc®  tne 
external  object).  Tne  next  instruction  of  the  interrupt 
vector  calls  the  linker  module  passim?  to  it  the  linkage 
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and  i  rc^istpr  pair  by  the  initialized  put^c:  r/  lirv.)  '"her 
the  1  i nicer  module  has  completed  execution  it  return s  t ne 
address  o*  the  outgo-; nm  link  to  tee  irtcrrupt  venter  (in  the 
hardware  H  ar.d  L  register  pair).  The  interrupt  vector  then 
pens  any  arguments  initially  passed  (by  the  caller!  tc  the 
external  object  into  the  E  and  E  register  pair  and  jumps  tc 
the  outgoing1  link-  (The  E  and  I  register  pair  is  used  to 
pass  arguments  or  pointers  to  a  list  of  areumerts  between 
°i terna 1  objects.) 

Finally,  process  initialization  loads  the  initial 
value  of  the  linkage  pointer  into  the  F  and  C  register  pair 
and  invokes  the  program  to  be  executed.  These  tve  functions 
are  performed  by  the  subroutine  FTFCJTF. 

2 ,  The  linker  Module 

Th.e  linker  module  was  initially  written  in  a  h  imh 
level  pseudocode  (appendi-T  (A')  and  tnen  tra.nclated  into 
FI/V'-PE.  This  permitted  an  orderly  approa-h  to  the 
implementation  of  the  dynamic  linker  module.  The  linker 
module  consisted  of  five  major  subroutines  and  a  -or.trcl 
routine.  The  logical  relation  between  linker  subroutines  is 
mi ven  in  fimur°  12. 

As  has  teen  noted  tne  linker  module  (i.e.,  the 
control  routine  IINKFF^  was  invoked  tv  the  interrupt  vector. 
LINKF?  first  -alls  on  ACCESS  $S  Y^irCL  IC  An  amFAFaTA  passir.m  as 
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CUT  IKES  C7  TVE  UtiUf  KuTULF 


arguments  the  linkage  pointer  and  tne  symbolic  name  offset. 
ACCESS  5S  YMEOI IC  ANAMEABATA  utilizes  the  linkage  pointer  to 
address  the  linkage  table  of  the  calling  pro^edurp  (which 
will  he  referred  to  as  <Caller>^  and  extracts  the  address  of 
Caller. syrr.  The  entry  of  the  external  reference  (viz., 
<Target  !  Sr.  trv_£l>)  is  then  computed  hv  adding  the  syrbolic 
name  offset  to  the  address  of  Caller. sym. 

AC CES SA S YMEOI. I C$ NAMFSDATA.  can  now  extract  (  f ror  the 
symbolic  name  table)  the  symbol!''  name  "Target",  the  entry 
name  ”Ertry_#l",  the  offset  of  <Ta r^et ! 2n t ry_^l> 's  outgoing 
link  (in  Call er . link ) ,  and  <Tareet>'s  type  (i.e.,  procedure 
or  data).  ACCESS ^STNPOI IC^NAMEABA TA  will  compute  the  address 
of  <Target J En try_£l>'s  outgoing  link  (by  adding  the  outgoing 
link  offset  to  the  base  of  Caller. link) .  Next  it  will  set 
the  C?/v  filetvpe  for  <Target>  (viz.,  'COM'  for  procedures 
and  'ETA'  for  datal  in  the  symbolic  name  buffer.  'The 
symbolic  name  buffer  stores  the  filename  and  filetvpe  of  tne 
object  being  linked  in  a  standardized  format.  The 
standardized  format  is  of  tne  form  '7IIZNAVE  .FIlZmv?'p ' .  Thus 
if  ^Target''  was  a  procedure,  the  symbolic  name  buffer  would 
contain  tne  entry  'TAHC-ST.COM'.) 

TINKER  can  now  call  on  the  address  space  mareger 
( ASMA^AK^ACCZSS AEIE)  to  learn  the  segment  number  (i.e..  the 
unique  identifier  and  base  aadress)  of  ^TargeO.  Cnee  I  INK  IF 
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knows  kTar^e  t,> '  s  segment  number  date,  if  will  invoke  the 
subroutine  1 1  MKAG£$TAEIE$F,CTJTI  NTS  . 

LINEAGE£TAILE£ROlTTINES  determines  if  a  linkage  table 
already  exist  for  <'T'arp-et>  by  checkins  the  valid  entry  bit 
of  the  linkage  address  table  entry. for  <Tarset>.  (Recall 
that  the  unique  identifier  of  an  object  is  used  as  a 
subscript  into  the  linkage  address  table  to  access  tne  base 
address  of  the  object's  linkage  table.)  If  Target. link  Joes 
net  exist,  IINKAGE$TABLF$R0U?IN7S  will  invoke 
BTTILIS0E JSC""$IIMK  to  construct  a  linkage  table  for  'Tar^eth 
and  will  update  <Tarf?et>'s  entry  in  the  linkage  address 
table.  Otherwise,  LIMEAGEaTABLS ^ROUTINES  merely  returns  a 
pointer  (the  parameter  'JEVSIINSSFTF. )  to  point  to 
Target. link. 

rUIIT^OR JECT^LI'JK  first  causes  the  aadress  space 
manager  to  enter  kTargeO 's  linkage  table  template  in  tne 
process  address  space.  It  does  this  by  appending  to  tr.e 
program  name  (<Tarset>)  the  CR/M  filetype  cf  'TVP'.  (7cr 
example,  if  <Tarf.et>  were  a  procedure,  the  executable  cone 
would  exist  in  the  file  TARGET. CO*  wnile  <7arsetv's  temDlate 
is  in  the  file  TABGET .TMR . )  Once  the  template  is  leaded  intc 
memory,  PUILTSOI JECT$II NK  fir^t  computes  the  address  of 
Ta  rse  t .  sym . 
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Recall  tnat  for  a  procedure,  the  symboli-  rare  table 
is  appended  to  the  end  of  the  object  code.  Thu?,  the  address 
of  the  symbolic  name  table  for  procedures  is  -orputea  by 
addin?  the  offset  of  the  symbolic  name  table  (found  in  tr.e 
template)  to  the  base  address  of  the  object.  For  data,  the 
symbolic  name  table  is  a  part  of  the  linkage  table  and  its 
address  is  computed  bv  addin?  the  symbolic  name  table  offset 
to  the  data  object's  linkage  table  base  address. 

3TTIIDS0BJECT$LINK  then  enters  tne  (computed) 
symbolic  name  table  address,  the  linkage  table  size,  ar.d  the 
body  of  the  linkage  table  in  the  combined  linkage  table  as 
Target .  1  ink.  (The  conbined  linkage  table  was  a  statically 
allocated  IK  block  of  memory.)  BUIIE^CE  JSCT4LINS  tner. 
removes  the  template  from  the  pro-ess  address  space 
by  invoking  (an  address  spare  manager 
routine  1  ?  ) , 

Now  that  Target. link  exist,  the  linker  module  can 
find  Target. sym  (via  a  pointer  in  Target . link 's  header)  ar.d 


17 

The  decision  to  build  linkage  tables  in  this  manner  was 
driven  by  an  effort  to  simulate  the  mechanisms  which  would 
occur  if  hardware  s°gmentat ion  were  available.  To  create 
Target.link  in  a  segmented  system,  it  would  be  necessary  to 
make  a  copy  of  the  (pure  and  sharable)  template.  However,  in 
this  impi ementa t i on ,  since  tne  disk  copy  of  a  template 
remains  pure,  the  process  copy  (as  introduced .  by  the  address 
space  manager)  could  have  just  as  easily  served  as  the 
linkage  table  without  recopving  it  into  tne  combined  linkage 
table.  (Note  that  this  approach  would  eliminate  the  need  for 
a  statically  allocated  combined  linkage  table.) 
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access  the  data  associated  witn  entry  name.  Thc  routine 
ACCrS3$ENT?.Y^NAN'S$EATA  does  this  by  searching  Target. s/r. 
with  the  argument  '2ntry_#l Recall  that  the  symbolic  narre 
table  entry  for  an  entry  name  includes  the  incoming  link 
offset  and  the  entry  point  (of  Tr.try_*l  into  <Target>).  'Fhus 
by  add  inf  the  incoming  link  offset  to  the  ba  s<=  addrpss  of 
Tareet.link,  the  incoming  link  address  can  be  computed. 
Additionally,  the  entry  point  (offset)  plus  thc  base  address 
of  <Tar?ret>  is  the  target  address  referenced  by  tr.e  symbolic 
nare  "Ta  r4’°t !  Tn  try_*l  "  . 

All  the  information  necessary  to  snap  the  link  is 
now  available  and  I  INKER  calls  on  tnc  subroutine 
SMAP$TFE$IINKS  to  perform  this  function.  The  final 
subroutine  of  the  linker  module  is  INITIAI IZESLINKER  which 
is  invokes  by  process  initialization.  INITIA1I7EUINSE? 
initializes  various  pointers  (used  by  the  linker  module)  and 
the  valid  entry  bits  of  the  linkage  address  table.  It 
returns  to  process  initialization  the  address  of  I  INKS?,  (for 
use  in  the  interrupt  vector),  the  adiress  of  the  linkage 
address  table  (which  is  passed  as  a  oarameter  to  the  display 
linkage  table  routine),  and  the  base  address  of  the  combined 
linkage  table  (which  is  used  in  EXECUTE  to  initialize  tne 
linkage  pointer) . 
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recause  the  CP/V  operating  system  lacked  any  memory 
management  executive,  it  was  necessary  for  the  address  space 
manager  to  perform  functions  wnich  would  usually  be  provided 
by  the  operating  system.  Thus  the  address  spare  ra name r  had 
to  he  abl<=  to  load  objects  into  free  memory  and  relocate 
executable  code.  These  functions  were  carried  cut  by  the 
subroutines  LCAL$0rJ2CT  and  RELOCATE  respectively.  The 
implementation  of  the  two  subroutines  was  extremely 
primitive  providing  only  tne  minimum  suprort  recessary  to 
allow  the  implementation  cf  the  remainder  of  tne  address 
space  manager  (and  will  not  be  discussed  in  any  further 
detail ) . 

like  the  dynamic  linker  module,  the  address  space 
manager  was  first  written  in  pseudocode  (app°ndix  (A.)  and 
translated  into  PI/m-60.  It  centers  around  the  managing  of 
the  process  reference  table  which  is  implemented,  as  an  array 
of  structures  of  the  form: 

Pro  cess  _?.ef  e  rence_Table  :  ARRAY  of  STRUCTURES  of 

Valid_bit  :  BOOLEAN; 

Name  :  ARRAY  of  CHARACTERS ; 
Ease  address  :  ACTRESS; 

end; 

The  valid_bit  field  was  set  to  'valid'  if  the  entry 
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represented  an  otject  in  tne  process  address  space.  Tne  rare 
field  contained  the  object  name  ir.  standardized  fcrm  'e.m.. 
CAILE® ,C0M )  while  the  tase_address  is  thp  location  (in 
memory)  where  the  object  was  loaded.  Niote  also  that  an 
object's  unique  identifier  represents  an  implied  process 
reference  table  field  and  corresponded  to  the  subscript  of 
the  object's  entry  in  the  process  reference  table. 

When  AS^$MAES$ACCE?SA5IE  is  invoiced,  it  is  passed 
the  object  name  (in  standard  form)  as  an  argument. 
ASMtlulAEEAACCESS ABLE  first  search°s  tne  process  reference 
table  to  determine  if  <Tarmet>  already  has  an  entry 
(implying  ^TarmeO  is  already  ir  the  orccess  address  spa'-e). 
If  not,  ICAr$CSJECT  is  invoked  to  load  CTar^e tb  into  memory 
returning  the  base  address  of  <^Tarmet>  to  the  point  of  call. 
ASM$ viEES ACCESSASLE  tner.  enters  <Target>  in  the  process 
reference  table  in  the  first  free  entry.  The  final  function 
ASb"$MAKE$ ACCESSA3I E  is  to  return  the  base  address  and 
unique  identifier  (viz.,  the  process  reference  table 
subscript)  o*  <Target>  to  the  point  of  call. 

The  subroutine  ASMiREi^OVESSEG  is  passed  ar  object 
ramp  (in  standard  form)  and  deiptes  the  object  from  the 
process  address  space  by  setting  the  object's  valid_bit  in 
the  process  reference  table  to  'invalid'. 
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't'wo  other  subroutines  i r. eluded  in  the  address  s^a-e 
irenae-er  were  DISPIA^PPT  which  displayed  the  prcress 
reference  table  (and  is  not  necessary  ir.  a  dynamic  linker 
implementation)  and  I MTI  ALIZZ^AS.d.  IMTI  All  Z2*  AS"  is 
invoked  by  process  initialization  fas  is  EISFI A  f$-P.  ;  ar.n 
initializes  the  valid_bits  of  the  process  reference  table  to 
'invalid'.  Additionally  it  statically  SPte  the  sire  of  the 
process  reference  table  'which  was  arbitrarily  set  tc  IP 
entries)  and  initializes  a  free  memory  pointer  for  the 
LOAE$CEJECT  subroutine. 
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3.  TFP  TEST  PP.OGr.fi vs 

Two  test  programs  were  ruu  on  the  dynamic  linger.  The 
first,  ES>0,  compute^  ar.d  aisplaved  (in  nexadecima  1  fern1! 
the  multiplication  and  addition  tables  (with  appropriate 
headers  for  the  rubbers  from,  7  to  IS.  Th°  second  test 
program,  5TJiv,  added  trie  element6  of  an  external  data  array 
a  n^  displayed  tne  result  in  hexadeciral  form.  T  TFO 
demonstrated  all  tne  capabilities  desired  of  dynamically 
linked  objects.  Sum  was  included  to  provide  a  simple  example 
that  will  be  explained  in  detail. 

1 •  Test  Program  Construction 

t 

iefere  discussing  either  test  program  further,  it  is 
useful  to  explaiL  the  mechanics  used  in  their  construction. 
First,  because  a  translator  wnich  supported  dynamic  link  inf 
was  r.ot  available,  it  was  necessary  tc  hand  assemble  these 
portions  of  the  test  programs  unique  tc  dynamic  linking. 
These  included  translated  external  references,  symbolic  name 
tables,  end  linkage  table  templates  .  All  test  program 
source  listings  and  program  test  results  are  included  in 
a ppend ix  (C ) . 


1 8 

The  test  proprams,  including  template6,  symbolic  name 
tables,  and  relocation  bits  (for  executable  code)  were 
written  in  8£b0  assembly  code  and  assembled  usin^'  the 
Digital  nesearcn  8F&C  Assembler  [17], 
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a.  ^ne  Assembled  Symbolic  Name  Table 

The  symbolic  nare  table  cf  an  object  ran  be 
found  (in  tne  source  listings)  at  the  end  of  either  tne 
object  code  (for  procedures)  or  in  the  linkage  table 
template  (for  data).  Eacn  entry  in  the  symbolic  name  tabic 
consist  cf  four  field.  For  clarity,  each  field  was  prececed 
by  a  label.  Entries  were  of  the  following  form: 

TSSCn  :  DE  byte_l  19 

LIN E r.  :  E3  low_byte,  nign_byte 

ENTKTn  :  D3  low_b.yte,  high_byte 

NAMEn  :  EE  'CiJECT_NA,vE  :2NTF.Y_N Ai^E  '  or  'ZNTEY_NAVE  ' 

DESCr  represents  the  entry  descriptor  (of  the  nth  symbolic 
name  table  entry).  The  most  significant  bit  of  byte_l 
indicated  the  object  type  (viz.,  0  for  procedures  and  1  for 
data).  The  five  least  significant  bits  of  byte_l  contained 
the  number  of  characters  in  the  name  field.  Thc  remaining 
two  bits  of  EESCn  were  unused. 

I INKc  is  the  offset  of  the  entry's  outvoice  cr 
incoming  link  in  tne  parent  object's  linkage  table.  The 

ENTHTn  field  is  an  entry  point  offset  in  the  parent  object 

19 

DE  is  an  assembler  pseudo-opera  ter  that  tells  the 
assembler  that  the  rest  of  the  line  represents  data.  Tata 
not  surrounded  by  single  quotes  is  translated  as  a  r.urerical 
value  while  data  in  Quotes  is  an  ASCII  character  string. 

20 

In  the  £060,  two  byte  values  are  stored  in  memory  with 
the  low  byte  in  the  1  ow<=r  numbered  men  cry  location.  In  us  tne 
number  1C20E  would  appear  as  20F,  ICE  when  used  in  a  EE 
field. 


associated  with  some  entry  name.  ?or  an  external  reference, 
low_byte  and  hif?h_fcyte  of  this  field  were  arbitrarily  set  to 
zero . 

Tne  NA^En  field  r.eld  tne  symbolic  rare 
associated  with  tne  entry.  This  field  cortair.ed  either  an 
entry  name  (e.g.,  SNTRYw"l),  or  the  name  of  an  external 
reference  (e.e.,  OBJECTJIAMS  :ENTWY_M  A.vE  '  .  Tor  tne  *•!  A  v  Z  field 
of  an  external  reference,  the  'ENTFY_NAVE '  portion  is 
optional.  When  left  out,  it  implies  that  the  entry  r.arte  to 
he  used  is  the  same  as  the  object  name.  Tor  °\~rple.  the 
procedure  Y'lLT  has  an  entry  point  by  the  s-m--  rare  hut 
appears  as  'MULT'  in  ESMC's  symbolic  name  table  'vice 
'V’JLT:M'TLT 

h.  The  Assembled  Template 

The  linkage  table  template  was  constructed  as 
assembled  code.  Templates  were  of  the  form: 

SIZE  :  IB  lov_byte,  high_byte 

SNT  :  D5  low_byte,  high_byte 

BODY  :  EE  Pi,  Pi,  ii,  £*.,  Pi,  P£  (in-orin?  link' 

?TTSH  D  (  cu  tgc  i  ng  link) 

LXI  D,  symbol ic_name  table  offset 
F  S  T  4 

The  SIZE  field  contains  The  number  of  bytes  in 
the  template.  SNT  represents  the  offset  (i.e.,  number  of 
byte$)  of  the  symbolic  name  table  from  the  hemi  nnir.p-  of 
eith°r  a  procedure  segment  or  a  data  segment's  template. 
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Trie  BOD7  of  a  template  contains  two  types  of 
entries.  For  an  incoming  link,  the  template  merely  reserves 
six  bytes  (initialized  to  <Z)  in  the  combined  linkage  table 
in  which  the  snapped  incoming  link  will  eventually  be 
placed.  An  outgoing  link  consist  cf  three  assembly  code 
instructions.  The  first  instruction  (PUS K  D)  saves  tne 
argument  register  (viz.,  the  D  and  £  register  pair)  prior  to 
loading  tr.at  register  with  the  symbolic  rare  table  offset  cf 
the  external  object  to  be  linked.  The  third  outgoing  link 
instruction  (PST  4)  causes  a  software  fault  resulting  in  the 
invocation  of  the  linker  via  the  interrupt  vector. 

c.  Other  Problems  in  Test  Program  Ccrstructicr 

Eecause  the  P2P*  microprocessor  does  rot  nave  an 
indirect  addressing  CALI  instruction,  tne  transfer  of 
control  to  an  outgoing  link  (by  the  executing  procedure) 
deserves  explanation.  Recall  that  it  is  desired  tc  perfcrn 
tne  following : 

CALL  (Lp  +  outgoi ng_link_of fset ) 

To  achieve  this  in  PCS?  code,  the  following  sequence  of 
instructions  was  used: 

PUSH  P 

LXI  H,  return_address 

pUCU  u 

LYI  H,  outgoing  link  offset 

TAT  £ 

FCFL 

return  address  :  PC?  ? 
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The  first  instruction  (PUSH  B)  saves  tne  linkage 
pointer.  The  next  two  instructions  save  the  return  address 
(which  is  normally  done  automa tically  by  a  CALL 
instruction).  The  H  and  I  register  pair  is  then  loaded  with 
the  outgoing_link_of fset  and  added  to  the  I  and  C  register 
pair  (viz.,  the  linkage  pointer)  by  the  LAE  B  instruction. 
DAD  5  adds  the  1  and  C  registers  to  the  H  and  1  registers 
and  leaves  the  result  in  tne  H  and  1  registers.  The  value 
'Ip  +  outfroi  ns_li  nk_of  f  set '  is  in  now  jumped  to  by  the  PCHI 
instruction  (which  transfers  control  to  tne  address  stored 
in  the  F  and  L  registers).  The  final  instruction  (PDF  £' 
restores  the  linkage  pointer  upon  return  from  the  external 
procedure . 

Very  briefly,  a  relocation  bits  file  was 
constructed  by  hand  and  was  of  the  following  form: 

SIZE  :  DB  low_byte,  high_byte 

IClfe  :  DB  binary_number_l ,  bir.a ry_number_2 

The  SIZE  field  represents  the  number  of  bytes  in  the 

relocation  bits  file.  The  remainder  (of  the  file^  consisted 

of  two  binary  numbers  preceded  by  a  label  sucn  as  Iflte 

(where  ei <Z<i  corresponds  to  an  address  in  the  procedure 

object  code  listing).  A  'G'  in  a  binary  number  corresponds 

to  a  non-relo eatable  byte  of  object  code.  A  'l'  identifies 

the  byte  as  the  first  of  a  two  byte  relocatable  address. 
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The  Test  Frogram  DEMO 


Tne  address  space  of  DEMO  included  four  objects:  fl) 
the  procedure  segment  CEMO'nstration )i  ( 2 )  tne  procedure 
segment  MHIT(iply)  which  included  the  entry  point  'MULT'; 
(3)  the  procedure  segment  DISPLY  which  included  the  entry 
points  'KEX_7ALUE'  and  'RUFFEE ';  (4)  and  the  data  segment 
HEADER  which  included  the  entry  points  'HEADER'  end  'TITLE'. 

As  has  teen  noted,  DEMO  computed  and  displayed  the 
(hexidecimal )  multiplication  and  addition  tables  for  tne 
values  P  through  15.  The  construction  of  each  table  was 
performed  by  the  internal  (to  DEMO)  procedure  Tuild_teble 
which  is  passed  a  subroutine  as  a  parameter  (viz.,  AIL ,  an 
internal  procedure,  and  MULT,  an  external  procedure'.  ADC 
and  MULT  are  passed  (by  ruild_table)  a  number  that  is 
added/multiplied  by  0  tnrough  15.  The  result  of  the 
computation  is  displayed  by  invoking  the  external  procedure 
D1SPLY.HSX_VA.LUE.  Thus  to  build  a  hexadecimal  table 
3uild_table  simply  Invokes  either  ADC  or  MULT  sixteen  times 
passing  as  a  parameter  the  values  from  2  to  15. 

ref ore  building  a  table.  DEMO  displays  an 
appropriate  heading.  It  does  this  by  dynamically  linking  to 
the  data  segment  HEADER ,  inserting  tne  appropriate  title 
(viz.,  MULTIPLICATION  or  ADDITION)  at  the  entry  point 
HEADER . TITLE ,  and  then  displaying  HEADER  by  passing  it  as  an 
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argument  to  the  external  procedure  n SPLY .BUFFER . 

The  dynamic  linking  which  takes  place  during  the 
execution  of  DEMO  is  given  in  figure  14.  EEMO  includes 
examples  of  all  the  various  capabilities  (of  external 
objects)  desired  in  a  dynamic  linking  environment  including: 

(1)  The  ability  to  dynamically  link  and  execute  external 
procedures — DEMO  dynamically  links  to  and  invokes  IISPIY. 

(2)  The  ability  to  reference  external  data — DEMO  links  to 
and  references  EEaEER. 

(3)  The  ability  to  pass  external  objects  as 
arguments — EEAEER  and  MULT  are  passed  to  EISFLY  and 
3uild_table  respectively. 

(4)  The  ability  of  an  external  object  to  engage  in  dynamic 
linking— MULT  dynamically  links  to  DISPLY. HEX  JT  ALUE. 

(5)  The  implementation  of  entry  points  in  objects — DISPLY 
and  FEAEEP  both  are  referenced  via  entry  points. 

3 .  The  Test  Program  SUM 

The  procedure  SUM  was  included  to  allow  a  complete 
and  comprehensive  discussion  of  the  concepts  presented  in 
this  thesis.  SUM  itself  is  rather  simple.  It  dynamically 
links  to  tne  external  data  segment  ARRAV  and  sums  the  (data) 
bytes  of  ARRAY.  The  results  are  displayed  by  dynamically 
linking  to  DIS*LT.KEX_v ALUE  (passing  the  sum  cf  ARRAY'S 
bytes  as  an  argument).  DISPLT. BUFFER  is  also  invoked  to 
display  appropriate  messages  along  with  the  computation 
result. 
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A  pseudocode  listing  of  SUM  is  given  in  figure  15 
while  figure  16  presents  a  representative  assembly  code 
translation  of  SUM.  The  assembly  code  used  is  not  associated 
with  any  particular  microprocessor,  but  is  considered  within 
the  capabilities  of  most  microprocessor  instruction  sets. 
The  only  instruction  used  which  may  cause  confusion  is 
LLPAP.AM  (viz.,  load  parameter  register).  This  Instruction  is 
simply  a  register  load  but  the  pneumonic  IIFARAM  is  offered 
tc  signify  tne  passing  of  arguments  to  an  external 
procedure.  (Note  that  the  dynamic  linger  demonstration 
implementation  uses  the  D  and  S  register  pair  for  this 
purpose . 1 

The  combined  linkage  table  for  S UM  is  snown  in 
figure  17.  (The  figure  does  not  include  ARRAY. link  or  a  link 
for  D  IS  FL'T.  BUFFER) .  The  linkage  table  for  SUM  ircludes  an 
incoming  link  (entry  #1)  which  would  be  used  if  SUM  were 
referenced  as  an  external  object.  Entry  #2  is  tne  outgoing 
link  from  SUM  to  ARRAY  while  Entry  m3  represents  tne 
outgoing  link  from  SUM  to  DISPLY .KEX_VALUE. 

When  the  two  outgoing  links  of  SUM  are  <=nepped,  tne 
unlinking  data  is  included  in  the  snapped  link  and  includes 
the  symbolic  name  table  offset  of  APRAT  and  I)I??Iv.Fi‘X_7A IUE 
(in  SUM.syn)  respectively  and  tne  appropriate  linked  list 
pointers.  Unlinking  linked  lists  are  implemented  as  circular 
linked  list.  Thus  the  linked  list  for  EISPLY  starting  with 
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PROCEDURE  Sum? 


/v  Sun  adds  the  tytes  of  the  external  data  structure 
'Array'  and  then  calls  on  the  external  procedure 
'Tisply'  to  output  the  result.  */ 

DECLARE  Sum  ENTRY  POINT? 

Array  DATA  EXTSPNAI ? 

Disply  PROCEDURE  EXTERNAL ? 

result  :  BYTE? 
array_pointer  :  POINTER! 

data_array  BASED  at  array_pointei  STRUCTURE  of 
number  of  bytes  :  BYTE? 
data  :~A?.RAT  of  BYTES? 

END  ? 

i  :  BYTE? 

/*  end  of  declarations  #/ 

array_poir.ter  »  address  cf  array? 
result  =  C? 

FOR  1  *  1  to  data_array .number_of_bytes? 

result  *  result  +  data  array. data  (i); 

END! OR? 

CALL  aisply .buffer  ('The  sum  of  the  data  arr^y  is 
CALI  disply ,hex_buffer  (result)? 

/'•'  generate  a  carriage  return  and  line  feed  #/ 

CALL  disply. buffer  (  CP. ,  I.F,  '&'■? 

CALL  disply  .buffer  ('End  of  Sum', '<*'); 

END  Sum, 


PSEUDOCODE  rCS  SIV! 
FIGURE  1? 
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the  header  entry  (in  DI SPLY . link)  foes  from  offset  l?K  to 
UCH  (the  snapped  outgoing  link  from  SUM  to 
DIS?LY.HEy_VALUE) .  The  linked  list  pointer  at  (in 
SUM. link)  points  to  DISPLT's  linkage  address  table  entry 
which  in  turn  points  to  LIS?IV.  1  ink  (viz.,  r.IS?Iv.link 's 
header  which  contains  the  first  node  of  EISPIY's  linked 
list). 

The  assembly  code  for  SUM,  ARRAY,  and  TISPLY  is 
included  in  appendix  (C)  along  with  the  output  generated  by 
SUM,  the  process  reference  table,  and  the  combined  linkate 
table  also.  The  process  reference  table  and  combined  linkage 
table  are  annotated  to  provide  additional  clarification. 

4 .  Observations  on  the  Implementation 

a.  Size  of  the  Dynamic  linker  Implementation 

The  dynamic  linker  including  the  display  linkage 
table  and  display  process  reference  table  routines  was  83-^ 
bytes  in  length.  This  includes  IK  bytes  of  memory  statically 
allocated  to  the  combined  linkage  table  and  15F  bytes 
reserved  for  the  hardware  stack.  (It  should  he  noted  tnat 
additional  memory  was  allocated  to  the  ?L/w-8?  stack  segment 
to  prevent  stack  overflew  during  test  preerar  execution. 
This  was  necessary  since  the  PL/K-80  stack  is  allocated 
based  on  the  needs  of  the  dynamic  linker  and  does  not  take 
into  account  stack  operations  done  by  other  procedures  in  a 
process.)  It  is  emphasized  that  no  effort  was  make  to 
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optimize  the  object  code.  Instead,  the  dynamic  linker  was 
written  to  be  as  clear  and  obvious  as  possible. 

The  dynamic  linker  was  also  compiled  without  the 
display  linkage  table  and  display  process  reference  table 
routines  (which  were  included  for  the  purposes  of  the 
demonstration  only).  This  edition  of  tne  linker  was  6272 
bytes  in  length.  It  is  estimated  that  a  complete  (i.e., 
including  unlinking)  and  optimized  implemer ta ti on  of  the 
dynamic  linker  should  require  about  7222  bytes  of  object 
code.  It  is  noted  that  error  conditions  were  not  checked  for 
by  the  dvnamic  linker.  However,  since  there  are  essentially 
only  two  error  conditions  which  could  occur,  it  is  felt  that 
the  size  estimate  for  a  dynamic  linker  is  still  valid.  The 
error  conditions  which  may  occur  are  (1^  a  reference  is  made 
to  a  non-existant  entry  point  (Heferer.ces  to  non-existar.t 
files  are  flagged  by  the  library  routines.),  and  (2)  Tne 
statically  allocated  IK  combined  linkage  table  is 
overflowed.  Sues  problems  as  running  out  cf  free  rencry  or 
process  reference  table  entries  are  handled  by  the  unlinker, 
b .  Overnead  Associated  with  Snapped  links 

One  of  the  major  arguments  against  dynamic 
linking  is  the  issue  of  overhead  associated  with  snapped 
links.  Before  debating  tnis  must  tesue,  it  is  observed  that 
the  cost  cf  dynamic  linking  associated  with  snapping  a  link 
(i.e.,  the  first  reference  of  an  external  object)  is  on  the 


order  of  the  overhead  required  to  statically  link  the  same 
object . 

With  respect  to  snapped  procedure  links,  tne 
overhead  (associated  with  the  linker  implementation'!  lies  in 
two  areas.  First,  the  linkage  pointer  must  be  updated  tc 
always  indicate  the  executing  procedure's  linkage  table. 
Thus  the  linkage  pointer  must  be  saved  and  restored  for  each 
external  procedure  reference,  which  requires  an  additional 
two  instructions.  Additionally  the  linkage  pointer  is  set  tc 
point  to  the  (dynamically  linked)  external  procedure  by  the 
snapped  incoming  link,  which  requires  a  third  instruction. 
Secondly,  the  execution  point  goes  from  the  calling 
procedure  tc  tne  external  procedure  via  the  snapped  outgoing 
and  incoming  links.  This  requires  two  jump  instructions  not 
needed  for  internal  procedure  calls  thereby  trin^in?  the 
total  overhead  to  five  instructions.  It  is  noted  that  tne 
extensive  code  necessary  in  invoke  an  external  object's 
outgoing  link  is  considered  a  limitation  cf  tne  E?=£ 
(because  of  the  lack  of  an  -CSf  indirect  call  instruction' 
and  is  not  considered  overhead  induced  by  dynamic  linking. 

Recall  that  to  reference  external  data  (via  the 
out^oinp'  link)  a  call  to  the  outs-oiup  link  is  performed,  the 
virtual  address  of  the  data  is  loaded  in  a  pointer,  and  a 


return  instruction  (to  the  calling  procedure^  is  executed. 
Since  internal  data  is  essentially  referenced  by  loading  a 


VIII.  CONCLUSIONS 


Based  on  the  research  supported  in  this  thesis  it  is 
reasonable  to  assert  that  dynamic  linking  is  feasible  in  a 
microcomputer  environment.  However,  given  that  the  linker 
design  is  implementable  on  microprocessors,  it  can  be 
asserted  that  dynamic  linking  does  not  require  the  support 
of  specialized  hardware  and  thus  can  be  feasibly  implemented 
on  most  general  purpose  computers  (including  minicomputers 
and  main  frames).  The  overhead  is  within  reason  and  can  be 
far  outweighed  by  the  derived  benefits.  It  has  been  implied 
[9,  13,  14]  that  dynamic  linking  requires  the  support  of 
specialized  hardware.  It  is  felt  that  the  major  contribution 
of  this  thesis  is  to  dispell  that  notion. 
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APPENDIX  A  -  PSEUDOCODE 


EXECUTIVE  linker; 


Explanation  of  variables  and  constants  : 

En_buffer  -  The  entry  name  buffer  is  a  string  variable 
where  the  entry  name  associated  with  an  external 
reference  is  stored  once  the  entry  nape  has  been 
extracted  from  the  calling  procedure's  symbolic  name 
table . 

Fixed_?n_of f set  -  The  fixed  symbolic  name  offset-  is  a 
constant  which  represents  the  number  of  bytes  in 
that  portion  of  a  symbolic  name  table  entry  that 
does  not  vary  in  size  (i.e.,  the  descriptor,  link 
offset,  and  entry  point). 

Free_link_table  -  The  free  linkage  table  variable  is 
the  next  free  location  in  the  combined  linkage  table 
where  new  (object)  linkage  tables  can  be 
constructed. 

In_l ink_address  -  The  incoming  link  address  is  .he 
virtual  address  of  the  incoming  link  for  the 
<external_procedure ! entry_name>  being  linked. 

Incoming! ink  -  The  incoming  link  structure  represents 
the  format  of  an  incoming  link.  Incoming  link  is 
based  at  the  incoming  link  address. 

Iinkage_ptr  -  The  linkage  pointer. 

Linkage_a r ray  -  Iinkage_a rray  is  a  linkage  table 
structure  based  at  the  linkage  pointer. 

New_link_ptr  -  The  new  linkage  pointer  is  assigned  tne 
value  of  the  linkage  pointer  of  the  external  object 
being  linked. 

New_link_table  -  The  new  linkage  table  is  a  linkage 
table  structure  (of  the  external  object  bee-in 
linked)  based  at  the  new  linkage  pointer. 

Ob  ject_sei?_number  -  Object  segment  number  is  the 
segment  number  assigned  to  the  external  object  being 
linked. 
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Object_type  -  Object  type  represents  whether  the 
external  object  begin  linked  is  a  procedure  or  data. 


Out_link_address  -  Tne  outgoing  link  address  is  the 
virtual  address  of  the  outgoing  link  assigned  to  the 
<external_object !entry_name>  being  linked. 

Outgoing_l ink  -  The  outgoing  link  based  at  the 
outgoing  link  address. 

Fn_buffer  -  The  symbolic  name  buffer  is  a  string 
variable  where  the  symbolic  name  of  the  external 
reference  is  stored  once  the  symbolic  name  has  teen 
extracted  from  the  calling  procedure's  symbolic  name 
table . 

Sn_address  -  The  symbolic  name  address  is  a  pointer 
into  a  symbolic  name  table. 

Sn_item  -  A  symbolic  name  item  is  a  structure  based  at 
the  symbolic  name  address  and  represents  an  entry  in 
a  symbolic  name  table. 

Sn_offset  -  The  symbolic  name  offset  is  the  parameter 
passed  the  linker  and  is  the  offset  into  tne  calling 
procedure's  symbolic  name  table  of  the  external 
reference  to  be  linked. 

Sn_size  -  The  symbolic  name  size  is  the  numbpr  of 
character  in  an  <external_ref erence ! ent ry_name'  as 
found  in  a  symbolic  name  table  entry. 

Sn_size_mask  -  The  symbolic  name  size  mask  is  used  tc 
extract  the  size  of  a  symbolic  name  from  a 
descriptor  in  the  symbolic  name  table  of  the  calling 
procedure. 

Sn_type_mask  -  The  symbolic  type  mask  extracts  tne 
type  of  an  external  reference  (i.e.,  procedurp  or 
data)  from  the  descriptor. 

Target_address  -  The  target  address  is  the  ultimate 
virtual  address  in  an  external  object  which  the 
calling  procedure  seeks  to  reference. 
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TeiPDlate  s eg  number  -  The  template  segment  number  is 
the  segment  number  assigned  to  the  lir.Kaee  table 
template  when  it  is  entered  in  a  process  address 
space . 

Template  -  Template  is  a  linkage  table  template 
structure  based  at  template  segment  number. 

Type  data  -  Type  data  is  a  constant  which  is  used  to 
identify  external  data  objects. 

Type_procedure  -  Type  identifies  external  procedure 
procedure  objects. 

/#  end  of  variable  explanations  */ 


/*  explanation  of  declaration  types  */ 


ADDRESS  -  a  virtual  address. 

EYTE  -  the  contents  of  a  virtual  address. 

CHARACTER  -  an  ASCII  character. 

INTEGER  -  a  variable. 

POINTER  -  an  address  variable  which  points  to  a 
user  defined  data  structure. 

STRUCTURE  -  a  Pascal  record. 


/*  end  of  explanation  of  declaration  types  */ 
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/*  The  following  is  a  list  of  variable  and  constant 
declarations  used  in  ttie  linker.  */ 

DECLARE 


In  buffer  :  STRUCTURE  of 

size  :  INTEGER  ? 

naire  i  ARRAY  of  CHARACTERS* 

END? 

?ixed_Sn_off set  :  INTEGER  CONSTANT; 

Free_Iink_table  :  ADDRESS; 

In  link  address  :  POINTER; 

Incoming  link  :  STRUCTURE  BASED  at  In_l  ir.K_add  ress  of 
Link_snapped_bi t  :  BITE; 

Load_Ip  :  INTEGER  5 
Jump  i ns t  :  INTEGER; 

end;- 

Linkage  ptr  :  POINTER; 

linkage  array  :  STRUCTURE  BASED  at  Linkage_ptr  of 
Size  :  INTEGER; 

Snt_address  :  AEr?.ESc; 

Body  :  ARRAY  of  BYTE; 

end; 

Lickage_address_table  i  ARRAY  of  ADDRESS; 

New  link  ptr  :  POINTER? 

New-link“table  :  STRUCTURE  EASED  at  New_lir.k_ptr  of 

Size  :  INTEGER; 

Snt_address  :  ADDRESS* 

Body  J  ARF-AY  of  BYTE; 

end; 

Ob ject_se«_numbe r  :  ADDRESS; 

Object_type  :  BYTE? 

Out  link  address  :  POINTER* 

Outpoin?"*l ink  :  ARRAY  of  INTEGER 

BASED  at  Ou t_l ink_addres s ; 

Sn  buffer  :  STRUCTURE  of 

size  :  INTEGER* 

name  :  ARRAY  of  CHARACTERS? 

end; 
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Sn_address  :  POINTER; 

Sn^item  :  STRUCTURE  RASED  at  Sn_address  of 
descriptor  :  BYTE; 
name  :  ARRAY  of  CHARACTERS; 
lini_of f set  :  INTEGER; 
entry  point  :  INTEGER; 

end; 

Sn  offset  :  INTEGER; 

Sn“size  :  INTEGER; 

Sn^sizejnask  :  BITE  CONSTANT; 

Sn”type ~ma sk  :  BYTE  CONSTANT; 

"par£et_address  :  ADDRESS; 

Templa te_se*_number  :  POINTER? 

Template  :  STRUCTURE  BASED  at  Template_sep  numter  of 
Size  :  INTEGER; 

Snt  offset  :  INTEGER; 

Bodv  :  ARRAY  of  EYTI » 

end; 

Type_data  :  BYTE  CONSTANT? 

Type_procedure  :  EYTE  CONSTANT? 

END  of  declarations; 


/*  linker  Control  Module  */ 


BEGIN 

/*  Save  processor  registers  if  necessary  */ 

CALL  Access_Symbolic_Name_Data; 

PARAPET ER_IIST  :  Sn_cffset,  Sn_buffer,  En_buffer. 

Linkage  pointir; 

RETURN_LIST  :  Sn_address,  En_buffer,  Sn_bu?fer, 

Object_typef  Cut_link_address ; 


/xt  asm  Make  Accessable  calls  on  the  Address  Space  Manager 
to  add  the  object  found  in  Sn_tuffer. nare  to  the 
process  address  space  and  return  tne  segner.t 
nurrber  assigned  to  that  object.  */ 

CALL  AS M_Make_Acces sable; 

PARAMETER  LIST  :  Sn  buff er. name? 

?.ETURN_LI  ST  :  Ob ject_see_number ; 

CALL  Linkage_Table_P.outines » 

PA?.AMETER_LIST  7  Ob jec t_se*r_numbe r ,  Link_address_table 

Pree_link_table,  Nev_lir.k_ptr  ; 

RETURN _LI ST  :  New_Iink_ptr ,  Iin'<_address_tabl»i , 

Free_link_table; 

CALL  Access_Entry_Name_Cata; 

PA?.AME?E?_I 1ST  :  Sn_address,  En_buffer,  New_link_ptr, 

Object_type,  ObJect_see_nur:ber ; 
?ETTTPN_LIST  :  Target_address ,  In_lir.k_address; 

CALL  Snap_the_Links ; 

FAEA^ET E?._LIST  :  In_link_address,  Out_lir.k_cddress . 

New_link_ptr,  Object_tyte, 

•  Target  address; 

RETURN  LIST  :  None; 


/**  restore  processor  registers  if  necessary  */ 
JUMP  to  Out_link_address? 

END  Linker; 
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Access_Symbolic_Name_Data  performs  the  following  functions: 

1.  Obtains  the  address  of  the  symbolic  name  of  the 
external  reference  being  linked. 

2.  loads  the  symbolic  name  of  the  external  reference 
in  the  symbolic  name  buffer  (S n_Ji:f fer )  . 

3.  loads  the  entry  name  of  the  external  reference 
in  the  entry  name  buffer  (En_buffer). 

4.  Computes  the  outgoing  link  address  and  determines 
whether  the  external  object  is  a  procedure  or  data. 

3*3!C3il*«V*3jl#*#3(C  ******)!'«*««*  J*Jjc  W  $**5*  VWS?  WWWW>?>itnt  W  / 

PEOCEDHRE  Acces  s_Symbolic_Name_Ba  ta  ; 

PARAV.ET3R_IIST  :  Sn_offset,  Sn_buffer,  En_bu?fer, 

Iinkage_pointer? 

DECLARE  i,  temp  :  INTEGER? 

Sn_address  =  Linkage_array. Snt_address  +  Sn_offset? 
Sn_size  *  Sn_item.size  AND  Sn_s i ze_mask? 

/*  load  the  symbolic  name  into  Sn_buf fer  .name .  */ 

i  ®  1  * 

DO  WHILE  (Sn_item.name(i)  <>  AND  (i  '  =  Sn_slze'? 

Sn  buf f er.name( i }  =  Sn  i tem.name(i  )? 
i  =  1  +  1? 

entweile; 

/*  Load  symbolic  name  size  into  Sn_tuf f er .  si :e .  *  ' 

IF  i  =  Sn  size  TEEN  Sn  buffer. size  =  i? 

ELSE  Sn~buffer .size  =  (i-i>; 


/*  load  the  entry  name  buffer  with  th<=>  entry  name. 

I?  i  =  Sn_size  mK2N  BEGIN 

/*  No  entry  name  specified,  default  to  the 
symbolic  name  as  tne  entrv  name.  */ 

rn  buffer. size  =  Sr.  size,' 

FOR  i  =  1  to  Sn_size  by  1  TO 

En_>'uffer  .name  (i  )  =  ?n_iten. named  )  5 

endthsn; 

ELSE  BEGIN  /*  entry  name  specified  */ 
temp  *  ii 
i  -  i  +  i; 

/*  load  size  of  entry  name  in  3n_buf fer. si ze  */ 

Sn_Buf f er.si ze  =  Sn_size  -  i? 

/*  load  entry  name  into  entry  name  buffer  *  t 

DO  WHILE  (i  <=  S n_si ze  ) 

2n  Euf fer .name ( i  -  temp'  *  Sn  item. name ( i )» 
i  =  i  +  i; 

endwkile; 

endeise; 

/*  Compute  the  address  of  the  outgoing  lir.’<  and 
determine  the  type  (procedure  or  data)  of  the 
external  object.  */ 

Out_l ink_address  *  Linkage_poin ter  + 

Sn_itemTlir.jt_offset; 

Object_type  ■  Rn_item. descriptor  AND  Sn_type_rasl«:; 


RETU?N_LIST  :  Sn_address,  En_tuffer,  Sn_trffer, 
Otject_type,  Out_lin'<_addre«.s  J 

END  Access_Synbolic_Name_Data; 


\ 


pTfifK'r'W’Z 

Linka^eJTable^outines  performs  the  following  furctior.s: 

1.  Eetermines  if  a  linkage  table  already  exist  for  the 
external  reference  tein*-  linked. 

a.  If  not,  Lirkage_Tabie_Rcutines  initialized  the 
Linkage  Address  Table  value  for  tne  object  and 
then  calls  on  Euild_Ob ject.link. 

h.  If  so,  Linkage_Table_?.outines  sets  a  return 
parameter  ( New”link_ptr )  equal  to  tne  linkage 
pointer  value  for  the  new  object's  linkage  table. 

^w^w*#*:***#***  *>?*«*«  *#**>!<  **3  *>)<  *  *1*  v  #  >!«  v  wv  v  wv  ttigcv  wpn  ***  / 


PP0CSEUP2  Linkage_Table_?.outines  ; 

FA?AMET2R_LIST  :  ObJect_se<?_nurrber,  Lir.k_aidress_table , 

Free_llnk_tatle ,  New_link_pt rl 

IF  Link_address_table  (Ob jer  t_seg_r.umb',r )  =  nil  TH2A 

/*  This  is  the  first  time  tne  object  has  teen 
referenced  by  the  process  and  tne  linker  must 
build  a  linkage  table  for  the  object.  */ 


Link_address_table  (Ob ject_seg_nurrter)  = 

F ree_li nk_table  J 

CALL  3uild_0bject . link; 

FA?.AMETER_LIST  :  Object_type,  Free_l i  rk_  table  , 

Sn_buffer,  Ob ject_sep_Hurber ; 
HETTJP.N_LIST  :  New_l  ink_ptr ,  Free_link_table  ; 


enetfen; 

ELSE 

/*  The  object  already  has  a  linkage  table.  */ 

New_link_ptr  =  Link_address_tatle  ( Object_seg_numter) ; 

RETUEN_LIST  :  i\’ew_linK_ptr ,  Link_address_table , 
Free_link_table> 

END  Linkage  Table.^ou tines » 
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Bui  ld_Ob  ject .  li  nk  performs  tne  following  functions: 

1.  Causes  the  Address  Space  vana;?er  to  load  the  external 
object's  linkage  table  template  into  the  process 
address  space. 

2.  Initializes  a  return  parameter  ( New_li nk_pt r )  to  the 
value  of  the  object's  linkage  pointer. 

3.  Appends  Object .link  to  the  end  of  the  combined  linkase 
table. 

4.  Deletes  the  linkage  table  template  from,  tne  process 
address  space. 
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PROCEDURE  Euild_0b jec  t .link; 

?AEAmSTEE_LIST  :  Cbject_type,  Free_l  ink_table,  Sr._ tuffer, 

Ob Jec  t_seg_numbe  r; 

DECLARE  i  :  INTEGER,* 


/*  mhe  following  twc  steps  cause  Sn_buffer .  ramc  to  be 
loaded  with  the  filename  <s/rrbolIc  name . template> 
and  then  invokes  the  Address  Space  manager  to  nave 
the  template  loaded  into  the  process  address  space 
(with  ASM_Make_AcCessable  returning  the  segment 
number  assigned  to  tne  template.  */ 

APPEND  'template'  to  Sn_buf fer. name ; 

CALL  ASf‘,_Make_Accessable »’ 

PA^AmETSP^IlST  :  Sn_buf fer . name ; 

RETURN_LIST  :  Template_seg_numter; 

New_link_ptr  =  Free_link_table; 


IF  Object_type  =  Type_procedure  THEN  EEC-IN 

/*  If  the  object  is  a  procedure,  then  it?  symbolic 
name  table  is  in  the  object  code  segment.  */ 

New_link_table.Snt_address  =  Template .Snt_offset 

Ob Jec t_seg_ number; 


ENDTHSN; 


+ 
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ELSE  EESIN 


/*  if  the  object  is  data,  then  its  symbolic 
table  is  in  its  template.  *V 


New  link  table  .Snt_address  =  Np>w_link_ptr  + 

Template .Snt_ 

endflse; 

New_link_table.Size  =  Template. Size,' 


FOR  i  =  0  to  ( Templa te .Si ae  -  2 )  by  1  CO 

New_link_table.Fody  (H  =  Tempi  a te  .Body  (i) 

endfor; 

Free_link  table  =  ?ree_link_table  +  1  + 

Template  .Size? 


CALL  ASM_Remove_Segj 

PARAMETER  LIST  :  Sn  buffer. name ; 
RETURN_L I  ST  :  None; 

RSTURN_LIS T  :  New_l i nk_pt r ,  Free_l ink_table ; 
END  Build _Object. link; 


name 


offset ; 
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Access_Ent ry_Name_Ea ta  performs  the  following  functions: 

1.  Computes  the  target  address  in  the  external  otject  to 
be  utilized  in  the  linkage  process. 

2.  Computes  the  incoming  link  address  (if  applicable^. 

v V V # V Sp asejje f? r.s W :;t K- J? -t s.s >;; v :? s;t »;; -.f ■•<■  y s;: s;: :,t y Xf / 


PROCEDURE  Access_Entry_Name_Eata ; 

PAPA^ETE?._LIST  :  Sr_address,  Sn_buffer,  New_  li  nk_p  t  r  , 

Object_type,  Ob  jec  t_s  eg_nrrf  ter  ; 

/*  Get_Next_Sn_i tern  causes  Sn_address  to  point  tc  the 
next  entry  in  the  external  object's  Symbolic  Name 
Table.  */ 


PROCEDURE.  Ge  t_Nex  t_S  r._i  tem  (Sn_address }  J 
Sn_address  =  Sn_address  +  Fixed_Sn_of fset 

*  (Sn  item.descrintor  AND  Sn  ?ize_mask)> 
RETURN  Sn_address ; ~ 

ENE  C-e  t_Next_Sn_i  tem 

/v  Begin  Access_Entry_Name_Ea ta .  ■■•  / 

Sn_address  =  Nev_link_table.$nt_address ; 

EO  WHILE  Sn_itpm.name  <">  Sn_buff er. name ; 

CALL  Get_Next_Sn_i tern? 

r’arget_address  =  Object_seg_r.umber  Sn_i  tem.ent  ry_pcint 

IF  Object_type  =  Type_pr ocedu re  THEN 

In_lir.k_addre?s  =  New_link_ptr  -1-  Sn_i  ten.  lir.k_off  set  j 

RETURN_LIST  :  Ta rge t_address ,  In_l i nk_address ; 

END  Access_Entry_Name_Data; 
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/  r.t  >s=P  s*  V  JJestiy  V*  ^!5p«ss;ssX  sjtjjc^sasr.s  >|sip  tf.  :sxt*5# -f  V  W  s*  >}tV"??V  =*  #  *5?=*  *f>f 

S  r.ap_the_Li  nks  performs  the  following  functions: 

1.  Snaps  the  outgoing1  link  and  incoming  link  for  a 
procedure  object. 

2.  Snaps  the  outgoing  link  for  a  data  oMect. 


PROCEDURE  Snap, tne_I inks » 

FARAMETER_LIST  :  I  n_l  i  mc_aidres s  ,  Ou  t _  1  i  r.k_address  , 

Nev_link_ptr,  Object, type, 

,T’arget_address  ? 

IF  C t  jec  t_type  =  Type_procedure  THEN  b EG  I N 

/"'  Snap  a  link  for  an  external  procedure.  */ 

Cutgoing_link  (?)  =  "Jump  to'  In_link_address ? 

IF  Incoming_link.link_snapned  bit  is  unsnapped  THEN 
BEGIN 

Incomir.f_link.Load_Ip  =  "load  Ip'  Nev_l ir.k_p t r ? 
Incoming~link. Jumu  inst  =  'Jump  to'  Target  address? 
enethen; 
enttken; 

ELSE  BEGIN 

/■•'  Snap  a  link  for  external  data.  */ 

Outgoing_lir.k  (?)  =  'load  pointer'  Tari'et_addres  s » 
Cuteoinsr  link  (l)  =  "Return  instruction'; 

eneelse; 

END  Snap, the_L i nks ? 
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EXECUTIVE  Address  Space  Manager; 

/:.« 

Explanation  of  variables  : 

?RT_size  -  Tne  size  of  the  process  reference  table. 

Ses_number  -  The  segment  number  assigned  to  a  npwly 
loaded  object  by  the  procedure  load_Cbject . 

PRT  _  rr.ijg  process  reference  table. 

/*  end  of  variable  explanation  */ 

/*  The  following  is  a  list  of  variable  and  constant 
declaration  used  in  the  Address  Space  Manager.  “/ 

DECLARE 


P?.T_size  :  INTEGER? 

Seg_Nurber  :  ADDRESS? 

PRT  :  ARRAY  of  STRUCTURES  of 
Valid  bit  :  BOOLEAN; 

Name  T  ARRAY  of  CHARACTERS? 
Seg_numter  :  ADDRESS? 

end; 


END  of  declarations; 


/ r?. X* j? 5J! SS xs  Xf  s*  * s;s  >js  * V  *»*f  x,i  V  V »*  v  V S?  V  V *s  V t Xt *fi*t V  x,t X« X< #  V Xt  X:  X«X= *?•  X= X«X:x:  sitjjt Xt  VJJS Xs 

ASM_;v‘ake_Accessable  nerforrs  the  following  functions: 

1.  Determines  if  the  object  passed  as  an  arguments  is 
already  in  tne  process  reference  table  (i.e.,  is 
already  in  the  process  address  space). 

2.  If  not,  loads  the  object  into  memory  at  the  nett 
available  memory  location  and  updates  the  Frocess 
Reference  Table  (PRT). 

3.  Returns  to  the  point  of  call  the.s°gmen't  number 
assigned  to  the  object. 

W#  »***#>£#**##  WJjs  #&&&##  WXs  ttzw  / 


PROCEDURE  AS M_Make_Acces sable? 

?AEAMETEE_IIST  :  Object_name? 

DECLARE  i  :  INTEGER? 

found  :  BOOLEAN? 

i  =  1? 

found  =  false? 

/*  check  to  see  if  Object_name  is  in  the  PRT  */ 

DO  WHILE  NOT  found  AND  i  <=  FRT.size? 

IF  PPT ( i ) . vali d_bi t  =  valid  TEEN  BEGIN 
IF  pT>T{i).name  =  Object_name 
THEN  found  *  true? 

ELSE  i  =  i  +  1? 

ENDTEEN? 

ENDVEILS? 
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IF  NOT  found  TEEN  BEGIN 

/*  find  a  free  PPT  entry  *=*/ 

i  =  1  f 

DO  WHILE  PRT(i). valid  bit  =  valid; 

i  =  i  +  1J 
endweile; 

CALL  Load  object? 

PARAMETER  LIST  :  Object.name? 
F.ETURN_LIST  *•  Seg_Number? 

PPT(i  )  .name  =  Object_Name? 
?RT(iK  se?_number  =  Se  ^Number? 
PPT(i ) ,valld_bit  =  valid? 

ENDTHEN  ? 

EETURN_LIST  :  P5T( i ) .see_number? 

END  ASM  (Make  Accessable? 
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ASM^Remo ve_Seg  performs  the  following  functions  : 

1.  Removes  an  object  from  a  process  address  space. 

*  s}e  sj«  *  V  V#  3  *  #  V  Sit  7f  TjtXf  sjt  >91 W  «  *  $  «  Jit  *  *  Sf  *>*  ?  $  9  $  $  $  #  V  ?  $  #»*  v  #  **  V  #  *  S« *  #  *  *  *  *  V  / 


PROCEDURE  AS^_Pemove_Seg» 

FARAMETER_L IS T  :  Object_namei 

ESCIAF. E  i  :  INTEGER? 

found  :  EOOLEAN; 

/*  find  Cbject_name  in  the  process  reference  table  (?RT)  */ 
i  =  i; 

found  *  false? 

TO  WHILE  NOT  found  AND  i  <=  ?RT_size; 

IF  FRT(i).name  *  Object_name 
THEN  found  =  true; 

ELSE  i  =  i  +  I? 

sndwhile; 

/*  remove  the  object  from  the  PRT  */ 

PRT ( i ) . val id_bi t  *  invalid; 

RETTJRN_LIST  :  none? 

END  ASM_,emove_seg; 
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PL/M-80  COMPILER  PROCESS  INITIALIZATION 


APPENDIX  B  -  DEMONSTRATION  SOURCE  LISTINGS 
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/***  PROGRAM  VARIABLES  ***/ 


PL/M-80  COMPILER  PROCESS  INITIALIZATION 
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171  !  READ$BISK  :  PROC  (BUFFER*ADDR )  PTTE  PUBLIC; 

172  2  DCL  BUFFER$ADBR  ADDR, 

temp  bite; 

173  2  CALL  M0N1 (26, BUFFER $ADDR); 
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The  System  Foutines  invoke  the  CP^M  operating  system 
to  perform  their  respective  functions.  This  entails 
calling  tne  subroutines  nonitor_l  (mor.ll  and 
monitor_2  (mon2).  monl  and  mon2  very  simply  transfer 
control  to  the  CF/M  operating  system:  via  a  jump  vector 
located  at  25K .  The  pseudocode  for  monl  and  mon2  is 
as  follows: 

mon/mon2  :  PRCCEEURE  {functior._numter,  eminent )  * 

DECLARE  function_number  BYTE, 
argument  AErF.FSS  , 

load  the  C  register  with  function_numher 

load  tne  E  &  E  register  with  ar^unent 

jump  to  the  CP/M  entry  point  /#  location  C5K  V 

/v  CP/M  new  performs  tne  desired  function  as 
determined  by  the  funct ion_number  ani 
areunents  */ 

return  byte  value  in  the  HAL  ree  /*  mon2  only  -”/ 
end  monl 


;  the  following  is  the  assembly  code  for  monl  and  non?. 
ORG  F1B0B 

CSEG  Icsef:  tells  the  assembler  to  produce 

^relocatable  code 
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THE  PROCESS  REFERENCE  TAP  IE 
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oe: 

ECT  NAvT’ 

-  DEMO.COM 

EARS  ADDRESS 

-  S6S9 

2 

OEJECT  NAME 

-  HEADER. DTA 

EASE  ADDRESS 

-  9211 
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OEJECT  NAME 

-  LISPLY.COM 

EAS 
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-  MULT . CGV 
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LINKAGE  TAILS  1  (Ip  *  7C3D  )  (DIM®) 


SIZE  -  35 
SN?  -  8966 
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LINKAGE  TA3IE  2  (Lp  *  7056  )  Q  HEADS*-) 


SIZE  -  25 


S  NT  -  7f7fc 


DATA  S^MPOLIC  NAME  TABLE  (ATDPESS  -  7k  7f) 

EES CBIPTCE  -  P6H 
LINK  OFFSET  -  Z 
ENTRY  POINT  -  Z 
NAME  -  FEADER 

DESCRIPTOF  -  05E 
LINK  OFFSET  -  Z 
ENTRY  POINT  -  17 
NAME  -  TITLE 
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J?*p  TO  7fS6 


M 


A'>ETiC  ?'JV  6 

D T N A v I C  LINKED  VERSION  l.f 

c£  c'tv  TFE  TATA  ARRAY  IS  4A 
ND  OF  SOM 


TFR 

PROCESS  REF F® 

1  : 

05 JEC'r  NA^F 
PAS?  AIiDRrSS 

?.  : 

OP  SECT  NAVF 
PA5?  AII'REcS 

*7 

QPJECT  NA^F 
PASE  AtEF.SSS 

4 

MO  entst 

5 

NO  ENTRY 

<3 

NO  ENTRY 

7 

NO  EN^PY 

ft 

NO  ENTRY 

9 

NO  ENTRY 

1? 

NO  EN TF'r 

11 

NO  ENTRY 

12 

NC  EM TFT 

13 

NO  ENT®'* 

14 

NC  ENTRY 

15 

NO  ENTRY 

15 

NO  ENTPr 

SUP.COM 
cc  °9 

ARRAY .  ETA 
9^53 
nspi* 
9339 


r  ^  vi 
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Ill'll 


1 


W 


THE  CAMPINED  I  INK  AGE  TABLE 


LINKAGE  TABLE  1  (Lp  =  7030  ^  (5**) 


SNAPPED  DATA  I  INK  (ADDRESS  -  n?^Z) 


SNAPPED  PROCEDURE  LINK  (ADDRESS  -  7045; 


snapped  procedure  dink  (address  -  7052) 


LINKAGE  TABLE  2  (Lp  =  ?056  )  (AR.R.AY) 

i _ _ _ _ _  i 

!  SIZE  -  14  ! 


SNT  -  '’060 


DATA  SYMBOLIC  NAME  TABLE  (ADDRESS  -  7050) 

DESCRIPTOR  -  05K 
LINK  OFFSET  -  Z 
ENTRY  POINT  -  0 
NAME  -  ARRAY 


SIZE  -  25 


SNT  -  8677 


ONS NAPPED 
INCOMING  LINK 


10 AD  90  93 


RETURN 


JLTvip  TO  7e&l 


JEM?  TO  7075 


LINKAGE  TABLE 


{Lp  -  7£71  ) 


(  MSPL/) 


1 

1 

<n  | 
«-i  1 
'i  1 

ij1  l 

i 

-  16 

“  1 

1 

1 

1 

S  NT 

-  3457 

1 

f 

LOAD 

Lr  7C71 

1 

l 

1 

INCOMING 

LINE 

( ADEPTS S 

jT!V? 

TO  939*= 

1 

1 

LOAD 

LP  ?t?71 

1 

1 

1 

INCOMING 

LINK 

( ALEUTS S 

JUMP 

TO  9423 

1 

1 

7C75 ) 


71  El ' 
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